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A NOVEL HUMAN G-PROTEIN COUPLED RECEPTOR, HGPRBMY4, 
EXPRESSED HIGHLY IN PROSTATE, COLON, AND LUNG 

FIELD OF THE INVENTION 

[0001] The present invention relates to the fields of pharmacogenomics, 

diagnostics, and patient therapy. More specifically, the present invention relates to 
methods of diagnosing and/ or treating diseases involving the Human G-Protein 
Coupled Receptor, HGPRBMY4. 

BACKGROUND OF THE INVENTION 

[0002] It is well established that many medically significant biological 

processes are mediated by proteins participating in signal transduction pathways that 
involve G-proteins and/ or second messengers, e.g., cAMP (Lefkowitz, Nature , 
351:353-354 (1991)). Herein these proteins are referred to as proteins participating in 
pathways with G-proteins or PPG proteins. Some examples of these proteins include 
the GPC receptors, such as those for adrenergic agents and dopamine (Kobilka, B. K., 
et al., PNAS . 84:46-50 (1987); Kobilka, B. K., et al., Science. 238:650-656 (1987); 
Bunzow, J. R., et al., Nature . 336:783-787 (1988)), G-proteins themselves, effector 
proteins, e.g., phospholipase C, adenylate cyclase, and phosphodiesterase, and actuator 
proteins, e.g., protein kinase A and protein kinase C (Simon, M. L, et al., Science , 
252:802-8 (1991)). 

[0003] For example, in one form of signal transduction, the effect of hormone 

binding is activation of an enzyme, adenylate cyclase, inside the cell. Enzyme 
activation by hormones is dependent on the presence of the nucleotide GTP, and GTP 
also influences hormone binding. A G-protein connects the hormone receptors to 
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adenylate cyclase. G-protein was shown to exchange GTP for bound GDP when 
activated by hormone receptors. The GTP-carrying form then binds to an activated 
adenylate cyclase. Hydrolysis of GTP to GDP, catalyzed by the G-protein itself, 
returns the G-protein to its basal, inactive form. Thus, the G-protein serves a dual role, 
as an intermediate that relays the signal from receptor to effector, and as a clock that 
controls the duration of the signal. 

[0004] The membrane protein gene superfamily of G-protein coupled receptors 

has been characterized as having seven putative transmembrane domains. The 
domains are believed to represent transmembrane a-helices connected by extracellular 
or cytoplasmic loops. G-protein coupled receptors include a wide range of 
biologically active receptors, such as hormone, viral, growth factor and 
neuroreceptors. 

[0005] G-protein coupled receptors have been characterized as including these 

seven conserved hydrophobic stretches of about 20 to 30 amino acids, connecting at 
least eight divergent hydrophilic loops. The G-protein family of coupled receptors 
includes dopamine receptors, which bind to neuroleptic drugs, used for treating 
psychotic and neurological disorders. Other examples of members of this family 
include calcitonin, adrenergic, endothelin, cAMP, adenosine, muscarinic, 
acetylcholine, serotonin, histamine, thrombin, kinin, follicle stimulating hormone, 
opsins, endothelial differentiation gene-1 receptor, rhodopsins, odorant, 
cytomegalovirus receptors, etc. 

[0006] Most G-protein coupled receptors have single conserved cysteine 

residues in each of the first two extracellular loops which form disulfide bonds that are 
believed to stabilize functional protein structure. The 7 transmembrane regions are 
designated as TM1, TM2, TM3, TM4, TM5, TM6, and TM7. TM3 has been 
implicated in signal transduction. 

[0007] Phosphorylation and lipidation (palmitylation or farnesylation) of 

cysteine residues can influence signal transduction of some G-protein coupled 
receptors. Most G-protein coupled receptors contain potential phosphorylation sites 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-3- 



within the third cytoplasmic loop and/ or the carboxyl terminus. For several G-protein 
coupled receptors, such as the p-adrenoreceptor, phosphorylation by protein kinase A 
and/ or specific receptor kinases mediates receptor desensitization. 
[0008] For some receptors, the ligand binding sites of G-protein coupled 

receptors are believed to comprise a hydrophilic socket formed by several G-protein 
coupled receptors transmembrane domains, which socket is surrounded by 
hydrophobic residues of the G-protein coupled receptors. The hydrophilic side of each 
G-protein coupled receptor transmembrane helix is postulated to face inward and form 
the polar ligand-binding site. TM3 has been implicated in several G-protein coupled 
receptors as having a ligand-binding site, such as including the TM3 aspartate residue. 
Additionally, TM5 serines, a TM6 asparagine and TM6 or TM7 phenylalanines or 
tyrosines are also implicated in ligand binding. 

[0009] G-protein coupled receptors can be intracellularly coupled by 

heterotrimeric G-proteins to various intracellular enzymes, ion channels and 
transporters (see, Johnson et al., Endoc. Rev .. 10:317-331(1989)). Different G-protein 
P-subunits preferentially stimulate particular effectors to modulate various biological 
functions in a cell. Phosphorylation of cytoplasmic residues of G-protein coupled 
receptors have been identified as an important mechanism for the regulation of G- 
protein coupling of some G-protein coupled receptors. G-protein coupled receptors 
are found in numerous sites within a mammalian host. 

[0010] G-protein coupled receptors (GPCRs) are one of the largest receptor 

superfamilies known. These receptors are biologically important and malfunction of 
these receptors results in diseases such as Alzheimer's, Parkinson, diabetes, dwarfism, 
color blindness, retinal pigmentosa and asthma. GPCRs are also involved in 
depression, schizophrenia, sleeplessness, hypertension, anxiety, stress, renal failure 
and in several other cardiovascular, metabolic, neuronal, oncology and immune 
disorders (F. Horn and G. Vriend, J. Mol. Med .. 76: 464-468 (1998)). They have also 
been shown to play a role in HIV infection (Y. Feng et al., Science . 272: 872-877 
(1996)). The structure of GPCRs consists of seven transmembrane helices that are 
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connected by loops. The N-terminus is always extracellular and C-terminus is 
intracellular. GPCRs are involved in signal transduction. The signal is received at the 
extracellular N-terminus side. The signal can be an endogenous ligand, a chemical 
moiety, or light. This signal is then transduced through the membrane to the cytosolic 
side where a heterotrimeric protein G-protein is activated which in turn elicits a 
response (F. Horn et al., Recept. and Charm .. 5: 305-314 (1998)). Ligands, agonists 
and antagonists, for these GPCRs are used for therapeutic purposes. 
[001 1] The present invention provides a newly discovered G-protein coupled 

receptor protein, which may be involved in cellular growth properties in the prostate, 
colon, lung, and heart based on its abundance in prostate, colon, lung, and heart 
tissues. The present invention also relates to newly identified polynucleotides, 
polypeptides encoded by such polynucleotides, the use of such polynucleotides and 
polypeptides, as well as the production of such polynucleotides and polypeptides. 
More particularly, the polypeptides of the present invention are human 7- 
transmembrane receptors. In addition, the invention also relates to inhibiting the 
action of such polypeptides. 

SUMMARY OF THE INVENTION 

[0012] The present invention provides a novel human member of the GPCR 

family (HGPRBMY4). Based on sequence homology, the protein HGPRBMY4 is a 
candidate GPCR. This protein sequence has been predicted to contain seven 
transmembrane domains, which is a characteristic structural feature of GPCRs. This 
orphan GPCR is expressed highly in prostate, colon, and lung with moderate 
expression in the heart. 

[0013] The present invention provides an isolated HGPRBMY4 polynucleotide 

as depicted in SEQ ID NO:l (CDS: 1 to 221 1). 

[0014] The present invention also provides the HGPRBMY4 polypeptide 

(MW: 35.4Kd), encoded by the polynucleotide of SEQ ID NO:l and having the amino 
acid sequence of SEQ ID NO:2, or a functional or biologically active portion thereof. 
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[0015] The present invention further provides compositions comprising the 

HGPRBMY4 polynucleotide sequence, or a fragment thereof, or the encoded 
HGPRBMY4 polypeptide, or a fragment or portion thereof. Also provided by the 
present invention are pharmaceutical compositions comprising at least one 
HGPRBMY4 polypeptide, or a functional portion thereof, wherein the compositions 
further comprise a pharmaceutically acceptable carrier, excipient, or diluent. 
[0016] The present invention provides a novel isolated and substantially 

purified polynucleotide that encodes the GPCR homologue. In a particular aspect, the 
polynucleotide comprises the nucleotide sequence of SEQ ID NO: 1 . The present 
invention also provides a polynucleotide sequence comprising the complement of SEQ 
ID NO: 1, or variants thereof. In addition, the present invention features 
polynucleotide sequences, which hybridize under moderately stringent or high 
stringency conditions to the polynucleotide sequence of SEQ ID NO:l. 
[0017] The present invention further provides a nucleic acid sequence 

encoding the HGPRBMY4 polypeptide and an antisense of the nucleic acid sequence, 
as well as oligonucleotides, fragments, or portions of the nucleic acid molecule or 
antisense molecule. Also provided are expression vectors and host cells comprising 
polynucleotides that encode the HGPRBMY4 polypeptide. 

[0018] The present invention provides methods for producing a polypeptide 

comprising the amino acid sequence depicted in SEQ ID NO:2, or a fragment thereof, 
comprising the steps of a) cultivating a host cell containing an expression vector 
containing at least a functional fragment of the polynucleotide sequence encoding the 
HGPRBMY4 homologue according to this invention under conditions suitable for the 
expression of the polynucleotide; and b) recovering the polypeptide from the host cell. 
[0019] Also provided are antibodies, and binding fragments thereof, which 

bind specifically to the HGPRBMY4 polypeptide, or an epitope thereof, for use as 
therapeutics and diagnostic agents. 

[0020] The present invention also provides methods for screening for agents 

which modulate HGPRBMY4 polypeptide, e.g., agonists and antagonists, as well as 
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modulators, e.g., agonists and antagonists, particularly those that are obtained from the 
screening methods described. 

[0021] Also provided by the present invention is a substantially purified 

antagonist or inhibitor of the polypeptide of SEQ ID NO:2. In this regard, and by way 
of example, a purified antibody that binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2 is provided. 

[0022] Substantially purified agonists of the G-protein coupled receptor 

polypeptide of SEQ ID NO:2 are further provided. 

[0023] The present invention provides HGPRBMY4 nucleic acid sequences, 

polypeptide, peptides and antibodies for use in the diagnosis and/ or screening of 
disorders or diseases associated with expression of the polynucleotide and its encoded 
polypeptide as described herein. 

[0024] The present invention provides kits for screening and diagnosis of 

disorders associated with aberrant or uncontrolled cellular development and with the 
expression of the polynucleotide and its encoded polypeptide as described herein. 
[0025] The present invention further provides methods for the treatment or 

prevention of cancers, immune disorders, neurological, or prostate-, colon-, lung- or 
cardiovascular- related disorders involving administering, to an individual in need of 
treatment or prevention, an effective amount of a purified antagonist of the 
HGPRBMY4 polypeptide. Due to its elevated levels of expression in specific tissues, 
the novel GPCR protein of the present invention is particularly useful in treating or 
preventing prostate-, colon-, lung-, and/or cardiovascular- related disorders, 
conditions, or diseases. 

[0026] The present invention also provides a method for detecting a 

polynucleotide that encodes the HGPRBMY4 polypeptide in a biological sample 
comprising the steps of: a) hybridizing the complement of the polynucleotide sequence 
encoding SEQ ID NO:2 to a nucleic acid material of a biological sample, thereby 
forming a hybridization complex; and b) detecting the hybridization complex, wherein 
the presence of the complex correlates with the presence of a polynucleotide encoding 
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the HGPRBMY4 polypeptide in the biological sample. The nucleic acid material may 
be further amplified by the polymerase chain reaction prior to hybridization. 
[0027] Further objects, features, and advantages of the present invention will 

be better understood upon a reading of the detailed description of the invention when 
considered in connection with the accompanying figures/drawings. 
[0028] One aspect of the instant invention comprises methods and 

compositions to detect and diagnose alterations in the HGPRBMY4 sequence in 
tissues and cells as they relate to ligand response. 

[0029] The present invention further provides compositions for diagnosing 

prostate-, colon-, lung-, and/or cardiovascular- related disorders and response to 
HGPRBMY4 therapy in humans. In accordance with the invention, the compositions 
detect an alteration of the normal or wild type HGPRBMY4 sequence or its expression 
product in a patient sample of cells or tissue. 

[0030] The present invention further provides diagnostic probes for diseases 

and a patient's response to therapy. The probe sequence comprises the HGPRBMY4 
locus polymorphism. The probes can be constructed of nucleic acids or amino acids. 
[0031] The present invention further provides antibodies that recognize and 

bind to the HGPRBMY4 protein. Such antibodies can be either polyclonal or 
monoclonal. Antibodies that bind to the HGPRBMY4 protein can be utilized in a 
variety of diagnostic and prognostic formats and therapeutic methods. 
[0032] The present invention also provides diagnostic kits for the 

determination of the nucleotide sequence of human HGPRBMY4 alleles. The kits are 
based on amplification-based assays, nucleic acid probe assays, protein nucleic acid 
probe assays, antibody assays or any combination thereof. 
[0033] The instant invention also provides methods for detecting genetic 

predisposition, susceptibility and response to therapy related to the prostate, colon, 
lung, and heart. In accordance with the invention, the method comprises isolating a 
human sample, for example, blood or tissue from adults, children, embryos or fetuses, 
and detecting at least one alteration in the wild-type HGPRBMY4 sequence or its 
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expression product from the sample, wherein the alterations are indicative of genetic 
predisposition, susceptibility or altered response to therapy related to the prostate, 
colon, lung and heart. 

[0034] In addition, methods for making determinations as to which drug to 

administer, dosages, duration of treatment and the like are provided. 

BRIEF DESCRIPTION OF THE FIGURES 

[0035] Figure 1 shows the full length nucleotide sequence of cDNA clone 

HGPRBMY4, a human G-protein coupled receptor (SEQ ED NO:l). 

[0036] Figure 2 shows the amino acid sequence (SEQ ED NO:2) from the 

conceptual translation of the full length HGPRBMY4 cDNA sequence. 

[0037] Figure 3 shows the 5' untranslated sequence of the orphan receptor, 

HGPRBMY4 (SEQ ID NO:3). 

[0038] Figure 4 shows the 3 ' untranslated sequence of the orphan receptor, 

HGPRBMY4 (SEQ ID NO:4). 

[0039] Figure 5 shows the predicted transmembrane region of the 

HGPRBMY4 protein where the predicted transmembranes, bold-faced and underlined, 
correspond to the peaks with scores above 750. 

[0040] Figures 6A- 6B show the multiple sequence alignment of the translated 

sequence of the orphan G-protein coupled receptor, HGPRBMY4, where the GCG 
pileup program was used to generate the alignment with other G-protein coupled 
receptor sequences. The blackened areas represent identical amino acids in more than 
half of the listed sequences and the grey highlighted areas represent similar amino 
acids. As shown in Figures 6A- 6B, the sequences are aligned according to their 
amino acids, where: HGPRBMY4 (SEQ ID NO:2) is the translated full length 
HGPRBMY4 cDNA; Q9WVN4 (SEQ ID NO:8) represents the mouse form of MOR 
5' Betal; Q9WVN5 (SEQ ID NO:9) is the mouse form of MOR 5' Beta2; Q9Y5P1 
(SEQ ID NO: 10) is the human form of HOR 5' Beta3; Q9YH55 (SEQ ED NO: 11) is 
the chicken form of an olfactory receptor-like protein; 088628 (SEQ ID NO: 12) 
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represents the rat form of olfactory GPCR RA1C; Q9WU89 (SEQ ID NO: 13) is the 
mouse form of odorant receptor S18; Q9WVD9 (SEQ ID NO:14) is the mouse form of 
MOR 3' Beta 1; Q9WU93 (SEQ ID NO: 15) is the mouse form of odorant receptor 
S46; and Q9WVD7 (SEQ ID NO: 16) is the mouse form of MOR 3' Beta3. 
[0041] Figure 7 shows the expression profiling of the novel human orphan 

GPCR, HGPRBMY4, as described in Example 3. 

[0042] Figure 8 shows the expression profiling of the novel human orphan 

GPCR, HGPRBMY4, as described in Example 4 and Table 1. 

[0043] Figure 9 shows the FACS profile of an untransfected CHO NFAT-CRE 

cell line. 

[0044] Figure 10 shows that the overexpression of HGPRBMY4 constitutively 

couples through the NFAT/CRE response element. 

[0045] Figure 1 1 shows the FACS profile of an untransfected CHO NFAT-G 

alpha 15 cell line. 

[0046] Figure 12 shows that the overexpression of HGPRBMY4 constitutively 

couples through the NFAT response element via the promiscuous G protein, G alpha 
15. 

[0047] Figure 1 3 shows that expressed HGPRBMY4 localizes to the cell 

surface. 

[0048] Figure 14 shows that representative transfected CHO-NFAT/CRE cell 

lines with intermediate and high beta lactamase expression levels useful in screens to 
identify HGPRBMY4 agonists and/or antagonists. 

DETAILED DESCRIPTION OF THE INVENTION 

[0049] The present invention provides a novel isolated polynucleotide and 

encoded polypeptide, the expression of which is high in prostate-, colon-, lung-, and 
cardiovascular- related tissues. This novel polypeptide is termed herein HGPRBMY4, 
an acronym for "Human G-Protein coupled Receptor BMY4 ". HGPRBMY4 is also 
referred to as GPCR9. 
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Defmitions 

[0050] The HGPRBMY4 polypeptide (or protein) refers to the amino acid 

sequence of substantially purified HGPRBMY4, which may be obtained from any 
species, preferably mammalian, and more preferably, human, and from a variety of 
sources, including natural, synthetic, semi-synthetic, or recombinant. Functional 
fragments of the HGPRBMY4 polypeptide are also embraced by the present invention. 
[0051] An "agonist" refers to a molecule which, when bound to the 

HGPRBMY4 polypeptide, or a functional fragment thereof, increases or prolongs the 
duration of the effect of the HGPRBMY4 polypeptide. Agonists may include proteins, 
nucleic acids, carbohydrates, or any other molecules that bind to and modulate the 
effect of the HGPRBMY4 polypeptide. An antagonist refers to a molecule which, 
when bound to the HGPRBMY4 polypeptide, or a functional fragment thereof, 
decreases the amount or duration of the biological or immunological activity of the 
HGPRBMY4 polypeptide. "Antagonists" may include proteins, nucleic acids, 
carbohydrates, antibodies, or any other molecules that decrease or reduce the effect of 
the HGPRBMY4 polypeptide. 

[0052] "Nucleic acid sequence", as used herein, refers to an oligonucleotide, 

nucleotide, or polynucleotide, and fragments or portions thereof, and to DNA or RNA 
of genomic or synthetic origin which may be single- or double-stranded, and represent 
the sense or anti-sense strand. By way of non-limiting example, fragments include 
nucleic acid sequences that are greater than 20-60 nucleotides in length, and preferably 
include fragments that are at least 70-100 nucleotides, or which are at least 1000 
nucleotides or greater in length. 

[0053] Similarly, "amino acid sequence" as used herein refers to an 

oligopeptide, peptide, polypeptide, or protein sequence, and fragments or portions 
thereof, and to naturally occurring or synthetic molecules. Amino acid sequence 
fragments are typically from about 5 to about 30, preferably from about 5 to about 15 
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amino acids in length and retain the biological activity or function of the HGPRBMY4 
polypeptide. 

[0054] Where "amino acid sequence" is recited herein to refer to an amino acid 

sequence of a naturally occurring protein molecule, "amino acid sequence" and like 
terms, such as "polypeptide" or "protein" are not meant to limit the amino acid 
sequence to the complete, native amino acid sequence associated with the recited 
protein molecule. In addition, the terms HGPRBMY4 polypeptide and HGPRBMY4 
protein are used interchangeably herein to refer to the encoded product of the 
HGPRBMY4 nucleic acid sequence of the present invention. 
[0055] A "variant" of the HGPRBMY4 polypeptide refers to an amino acid 

sequence that is altered by one or more amino acids. The variant may have 
"conservative" changes, wherein a substituted amino acid has similar structural or 
chemical properties, e.g., replacement of leucine with isoleucine. More rarely, a 
variant may have "non-conservative" changes, e.g., replacement of a glycine with a 
tryptophan. Minor variations may also include amino acid deletions or insertions, or 
both. Guidance in determining which amino acid residues may be substituted, 
inserted, or deleted without abolishing functional biological or immunological activity 
may be found using computer programs well known in the art, for example, 
DNASTAR software. 

[0056] An "allele" or "allelic sequence" is an alternative form of the 

HGPRBMY4 nucleic acid sequence. Alleles may result from at least one mutation in 
the nucleic acid sequence and may yield altered mRNAs or polypeptides whose 
structure or function may or may not be altered. Any given gene, whether natural or 
recombinant, may have none, one, or many allelic forms. Common mutational 
changes, which give rise to alleles, are generally ascribed to natural deletions, 
additions, or substitutions of nucleotides. Each of these types of changes may occur 
alone, or in combination with the others, one or more times in a given sequence. 
[0057] "Altered" nucleic acid sequences encoding the HGPRBMY4 

polypeptide include nucleic acid sequences containing deletions, insertions and/ or 
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substitutions of different nucleotides resulting in a polynucleotide that encodes the 
same or a functionally equivalent HGPRBMY4 polypeptide. Altered nucleic acid 
sequences may further include polymorphisms of the polynucleotide encoding the 
HGPRBMY4 polypeptide; such polymorphisms may or may not be readily detectable 
using a particular oligonucleotide probe. The encoded protein may also contain 
deletions, insertions, or substitutions of amino acid residues, which produce a silent 
change and result in a functionally equivalent HGPRBMY4 protein. Deliberate amino 
acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/ or the amphipathic nature of the residues, as long 
as the biological activity of the HGPRBMY4 protein is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid; positively 
charged amino acids may include lysine and arginine; and amino acids with uncharged 
polar head groups having similar hydrophilicity values may include leucine, 
isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine and 
threonine; and phenylalanine and tyrosine. 

[0058] "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti- 

gene agent which comprises an oligonucleotide ("oligo") linked via an amide bond, 
similar to the peptide backbone of amino acid residues. PNAs typically comprise 
oligos of at least 5 nucleotides linked to amino acid residues. PNAs may or may not 
terminate in positively charged amino acid residues to enhance binding affinities to 
DNA. Such amino acids include, for example, lysine and arginine among others. 
These small molecules stop transcript elongation by binding to their complementary 
strand of nucleic acid (P.E. Nielsen et al., 1993, Anticancer Drug Des., 8:53-63). PNA 
may be pegylated to extend their lifespan in the cell where they preferentially bind to 
complementary single stranded DNA and RNA. 

[0059] "Oligonucleotides" or "oligomers" refer to a nucleic acid sequence, 

preferably comprising contiguous nucleotides, of at least about 6 nucleotides to about 
60 nucleotides, preferably at least about 8 to 10 nucleotides in length, more preferably 
at least about 12 nucleotides in length e.g., about 15 to 35 nucleotides, or about 15 to 
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25 nucleotides, or about 20 to 35 nucleotides, which can be typically used in PCR 
amplification assays, hybridization assays, or in microarrays. It will be understood 
that the term oligonucleotide is substantially equivalent to the terms primer, probe, or 
amplimer, as commonly defined in the art. It will also be appreciated by those skilled 
in the pertinent art that a longer oligonucleotide probe, or mixtures of probes, e.g., 
degenerate probes, can be used to detect longer, or more complex, nucleic acid 
sequences, for example, genomic DNA. In such cases, the probe may comprise at 
least 20-200 nucleotides, preferably, at least 30-100 nucleotides, more preferably, 50- 
100 nucleotides. 

[0060] "Amplification" refers to the production of additional copies of a 

nucleic acid sequence and is generally carried out using polymerase chain reaction 
(PCR) technologies, which are well known and practiced in the art (see, D.W. 
Dieffenbach and G.S. Dveksler, 1995, PCR Primer, a Laboratory Manual, Cold 
Spring Harbor Press, Plainview, NY). 

[0061] "Microarray" is an array of distinct polynucleotides or oligonucleotides 

synthesized on a substrate, such as paper, nylon, or other type of membrane; filter; 
chip; glass slide; or any other type of suitable solid support. 

[0062] The term "antisense" refers to nucleotide sequences, and compositions 

containing nucleic acid sequences, which are complementary to a specific DNA or 
RNA sequence. The term "antisense strand" is used in reference to a nucleic acid 
strand that is complementary to the "sense" strand. Antisense (i.e., complementary) 
nucleic acid molecules include PNA and may be produced by any method, including 
synthesis or transcription. Once introduced into a cell, the complementary nucleotides 
combine with natural sequences produced by the cell to form duplexes, which block 
either transcription or translation. The designation "negative" is sometimes used in 
reference to the antisense strand, and "positive" is sometimes used in reference to the 
sense strand. 

[0063] The term "consensus" refers to the sequence that reflects the most 

common choice of base or amino acid at each position among a series of related DNA, 
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RNA or protein sequences. Areas of particularly good agreement often represent 
conserved functional domains. 

[0064] A "deletion" refers to a change in either nucleotide or amino acid 

sequence and results in the absence of one or more nucleotides or amino acid residues. 
By contrast, an insertion (also termed "addition") refers to a change in a nucleotide or 
amino acid sequence that results in the addition of one or more nucleotides or amino 
acid residues, as compared with the naturally occurring molecule. A substitution 
refers to the replacement of one or more nucleotides or amino acids by different 
nucleotides or amino acids. 

[0065] A "derivative" nucleic acid molecule refers to the chemical 

modification of a nucleic acid encoding, or complementary to, the encoded 
HGPRBMY4 polypeptide. Such modifications include, for example, replacement of 
hydrogen by an alkyl, acyl, or amino group. A nucleic acid derivative encodes a 
polypeptide, which retains the essential biological and/ or functional characteristics of 
the natural molecule. A derivative polypeptide is one, which is modified by 
glycosylation, pegylation, or any similar process that retains the biological and/ or 
functional or immunological activity of the polypeptide from which it is derived. 
[0066] The term "biologically active", i.e., functional, refers to a protein or 

polypeptide or fragment thereof having structural, regulatory, or biochemical functions 
of a naturally occurring molecule. Likewise, "immunologically active" refers to the 
capability of the natural, recombinant, or synthetic HGPRBMY4, or any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells, for 
example, to generate antibodies, and to bind with specific antibodies. 
[0067] The term "hybridization" refers to any process by which a strand of 

nucleic acid binds with a complementary strand through base pairing. 
[0068] The term "hybridization complex" refers to a complex formed between 

two nucleic acid sequences by virtue of the formation of hydrogen bonds between 
complementary G and C bases and between complementary A and T bases. The 
hydrogen bonds may be further stabilized by base stacking interactions. The two 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-15- 



complementary nucleic acid sequences hydrogen bond in an anti-parallel 
configuration. A hybridization complex may be formed in solution (e.g., C 0 t or Rot 
analysis), or between one nucleic acid sequence present in solution and another nucleic 
acid sequence immobilized on a solid support (e.g., membranes, filters, chips, pins, or 
glass slides, or any other appropriate substrate to which cells or their nucleic acids 
have been affixed). 

[0069] The terms "stringency" or "stringent conditions" refer to the conditions 

for hybridization as defined by nucleic acid composition, salt and temperature. These 
conditions are well known in the art and may be altered to identify and/ or detect 
identical or related polynucleotide sequences in a sample. A variety of equivalent 
conditions comprising either low, moderate, or high stringency depend on factors such 
as the length and nature of the sequence (DNA, RNA, base composition), reaction 
milieu (in solution or immobilized on a solid substrate), nature of the target nucleic 
acid (DNA, RNA, base composition), concentration of salts and the presence or 
absence of other reaction components (e.g., formamide, dextran sulfate and/ or 
polyethylene glycol) and reaction temperature (within a range of from about 5°C 
below the melting temperature of the probe to about 20°C to 25°C below the melting 
temperature). One or more factors may be varied to generate conditions, either low or 
high stringency, that are different from but equivalent to the aforementioned 
conditions. 

[0070] As will be understood by those of skill in the art, the stringency of 

hybridization may be altered in order to identify or detect identical or related 
polynucleotide sequences. As will be further appreciated by the skilled practitioner, 
melting temperature, T m , can be approximated by the formulas as known in the art, 
depending on a number of parameters, such as the length of the hybrid or probe in 
number of nucleotides, or hybridization buffer ingredients and conditions (see, for 
example, T. Maniatis et al., Molecular Cloning: A Laboratory Manual . Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, 1982 and J. Sambrook et al., Molecular 
Cloning: A Laboratory Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, 
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NY, 1989; Current Protocols in Molecular Biology . Eds. F.M. Ausubel et al., Vol. 1, 
"Preparation and Analysis of DNA", John Wiley and Sons, Inc., 1994-1995, Suppls. 
26, 29, 35 and 42; pp. 2.10.7- 2.10.16; G.M. Wahl and S. L. Berger (1987; Methods 
Enzvmol. 152:399-407); and A.R. Kimmel, 1987; Methods of Enzvmol. 152:507-511). 
As a general guide, T m decreases approximately 1°C -1.5°C with every 1% decrease in 
sequence homology. Also, in general, the stability of a hybrid is a function of sodium 
ion concentration and temperature. Typically, the hybridization reaction is initially 
performed under conditions of low stringency, followed by washes of varying, but 
higher stringency. Reference to hybridization stringency, e.g., high, moderate, or low 
stringency, typically relates to such washing conditions. 

[0071] Thus, by way of non-limiting example, "high stringency" refers to 

conditions that permit hybridization of those nucleic acid sequences that form stable 
hybrids in 0.018M NaCl at about 65°C (i.e., if a hybrid is not stable in 0.018M NaCl at 
about 65°C, it will not be stable under high stringency conditions). High stringency 
conditions can be provided, for instance, by hybridization in 50% formamide, 5x 
Denhardt's solution, 5xSSPE (saline sodium phosphate EDTA) (lx SSPE buffer 
comprises 0.15 M NaCl, 10 mM Na 2 HP0 4 , 1 mM EDTA), (or lx SSC buffer 
containing 150 mM NaCl, 15 mM Na 3 citrate • 2 H 2 0, pH 7.0), 0.2% SDS at about 
42°C, followed by washing in lx SSPE (or saline sodium citrate, SSC) and 0.1% SDS 
at a temperature of at least about 42°C, preferably about 55°C, more preferably about 
65°C. 

[0072] "Moderate stringency" refers, by non-limiting example, to conditions 

that permit hybridization in 50% formamide, 5x Denhardt's solution, 5xSSPE (or 
SSC), 0.2% SDS at 42°C (to about 50°C), followed by washing in 0.2x SSPE (or SSC) 
and 0.2% SDS at a temperature of at least about 42°C, preferably about 55°C, more 
preferably about 65°C. 

[0073] "Low stringency" refers, by non-limiting example, to conditions that 

permit hybridization in 10% formamide, 5x Denhardt's solution, 6xSSPE (or SSC), 
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0.2% SDS at 42°C, followed by washing in lx SSPE (or SSC) and 0.2% SDS at a 
temperature of about 45°C, preferably about 50°C. 

[0074] For additional stringency conditions, see T. Maniatis et al., Molecular 

Cloning: A Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY (1982). It is to be understood that the low, moderate and high stringency 
hybridization / washing conditions may be varied using a variety of ingredients, 
buffers and temperatures well known to and practiced by the skilled artisan. 
[0075] The terms "complementary" or "complementarity" refer to the natural 

binding of polynucleotides under permissive salt and temperature conditions by base- 
pairing. For example, the sequence "A-G-T" binds to the complementary sequence 
"T-C-A". Complementarity between two single-stranded molecules may be "partial", 
in which only some of the nucleic acids bind, or it may be complete when total 
complementarity exists between single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency 
and strength of hybridization between nucleic acid strands. This is of particular 
importance in amplification reactions, which depend upon binding between nucleic 
acids strands, as well as in the design and use of PNA molecules. 
[0076] The term "homology" refers to a degree of complementarity. There 

may be partial homology or complete homology, wherein complete homology is 
equivalent to identity. A partially complementary sequence that at least partially 
inhibits an identical sequence from hybridizing to a target nucleic acid is referred to 
using the functional term "substantially homologous". The inhibition of hybridization 
of the completely complementary sequence to the target sequence may be examined 
using a hybridization assay (e.g., Southern or Northern blot, solution hybridization and 
the like) under conditions of low stringency. A substantially homologous sequence or 
probe will compete for and inhibit the binding (i.e., the hybridization) of a completely 
homologous sequence or probe to the target sequence under conditions of low 
stringency. Nonetheless, conditions of low stringency do not permit non-specific 
binding; low stringency conditions require that the binding of two sequences to one 
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another be a specific (i.e., selective) interaction. The absence of non-specific binding 
may be tested by the use of a second target sequence which lacks even a partial degree 
of complementarity (e.g., less than about 30% identity). In the absence of non-specific 
binding, the probe will not hybridize to the second non-complementary target 
sequence. 

[0077] Those having skill in the art will know how to determine percent 

identity between/among sequences using, for example, algorithms such as those based 
on the CLUSTALW computer program (J.D. Thompson et al., 1994, Nucleic Acids 
Research, 2(22):4673-4680), or FASTDB, (Brutlag et al., 1990, Comp. App. Biosci., 
6:237-245), as known in the art. Although the FASTDB algorithm typically does not 
consider internal non-matching deletions or additions in sequences, i.e., gaps, in its 
calculation, this can be corrected manually to avoid an overestiniation of the % 
identity. CLUSTALW, however, does take sequence gaps into account in its identity 
calculations. 

[0078] A "composition comprising a given polynucleotide sequence" refers 

broadly to any composition containing the given polynucleotide sequence. The 
composition may comprise a dry formulation or an aqueous solution. Compositions 
comprising polynucleotide sequence (SEQ ID NO:l) encoding the HGPRBMY4 
polypeptide (SEQ ID NO:2), or fragments thereof, may be employed as hybridization 
probes. The probes may be stored in freeze-dried form and may be in association with 
a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
employed in an aqueous solution containing salts (e.g., NaCl), detergents or surfactants 
(e.g., SDS) and other components (e.g., Denhardt's solution, dry milk, salmon sperm 
DNA, and the like). 

[0079] The term "substantially purified" refers to nucleic acid sequences or 

amino acid sequences that are removed from their natural environment, isolated or 
separated, and are at least 60% free, preferably 75% to 85% free, and most preferably 
90% or greater free from other components with which they are naturally associated. 
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[0080] The term "sample", or "biological sample", is meant to be interpreted in 

its broadest sense. A biological sample suspected of containing nucleic acids encoding 
the HGPRBMY4 protein, or fragments thereof, or HGPRBMY4 protein itself, may 
comprise a body fluid, an extract from cells or tissue, chromosomes isolated from a 
cell (e.g., a spread of metaphase chromosomes), organelle, or membrane isolated from 
a cell, a cell, nucleic acid such as genomic DNA (in solution or bound to a solid 
support such as for Southern analysis), RNA (in solution or bound to a solid support 
such as for Northern analysis), cDNA (in solution or bound to a solid support), a 
tissue, a tissue print and the like. 

[0081] "Transformation" refers to a process by which exogenous DNA enters 

and changes a recipient cell. It may occur under natural or artificial conditions using 
various methods well known in the art. Transformation may rely on any known 
method for the insertion of foreign nucleic acid sequences into a prokaryotic or 
eukaryotic host cell. The method is selected based on the type of host cell being 
transformed and may include, but is not limited to, viral infection, electroporation, heat 
shock, lipofection, and partial bombardment. Such "transformed" cells include stably 
transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome. Transformed 
cells also include those cells, which transiently express the inserted DNA or RNA for 
limited periods of time. 

[0082] The term "mimetic" refers to a molecule, the structure of which is 

developed from knowledge of the structure of the HGPRBMY4 protein, or portions 
thereof, and as such, is able to effect some or all of the actions of the HGPRBMY4 
protein. 

[0083] The term "portion" with regard to a protein (as in "a portion of a given 

protein") refers to fragments or segments of that protein. The fragments may range in 
size from four or five amino acid residues to the entire amino acid sequence minus one 
amino acid. Thus, a protein "comprising at least a portion of the amino acid sequence 
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of SEQ ID NO: 2" encompasses the full-length human HGPRBMY4 polypeptide, and 
fragments thereof. 

[0084] The term "antibody" refers to intact molecules as well as fragments 

thereof, such as Fab, F(ab')2, Fv, which are capable of binding an epitopic or antigenic 
determinant. Antibodies that bind to HGPRBMY4 polypeptides can be prepared using 
intact polypeptides or fragments containing small peptides of interest or prepared 
recombinantly for use as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal can be derived from the transition of RNA or synthesized 
chemically, and can be conjugated to a carrier protein, if desired. Commonly used 
carriers that are chemically coupled to peptides include, but are not limited to, bovine 
serum albumin (BSA), keyhole limpet hemocyanin (KLH), and thyroglobulin. The 
coupled peptide is then used to immunize the animal (e.g, a mouse, a rat, or a rabbit). 
[0085] The term "humanized" antibody refers to antibody molecules in which 

amino acids have been replaced in the non-antigen binding regions in order to more 
closely resemble a human antibody, while still retaining the original binding 
capability, e.g., as described in U.S. Patent No. 5,585,089 to C.L. Queen et al. 
[0086] The term "antigenic determinant" refers to that portion of a molecule 

that makes contact with a particular antibody (i.e., an epitope). When a protein or 
fragment of a protein is used to immunize a host animal, numerous regions of the 
protein may induce the production of antibodies which bind specifically to a given 
region or three-dimensional structure on the protein; these regions or structures are 
referred to an antigenic determinants. An antigenic determinant may compete with the 
intact antigen (i.e., the immunogen used to elicit the immune response) for binding to 
an antibody. 

[0087] The terms "specific binding" or "specifically binding" refer to the 

interaction between a protein or peptide and a binding molecule, such as an agonist, an 
antagonist, or an antibody. The interaction is dependent upon the presence of a 
particular structure (i.e., an antigenic determinant or epitope) of the protein that is 
recognized by the binding molecule. For example, if an antibody is specific for 
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epitope "A", the presence of a protein containing epitope A (or free, unlabeled A) in a 
reaction containing labeled "A" and the antibody will reduce the amount of labeled A 
bound to the antibody. 

[0088] The term "correlates with expression of a polynucleotide" indicates that 

the detection of the presence of ribonucleic acid that is similar to SEQ ID NO:l by 
Northern analysis is indicative of the presence of mRNA encoding the HGPRBMY4 
polypeptide (SEQ ID NO:2) in a sample and thereby correlates with expression of the 
transcript from the polynucleotide encoding the protein. 

[0089] An "alteration" in the polynucleotide of SEQ ID NO: 1 comprises any 

alteration in the sequence of the polynucleotides encoding the HGPRBMY4 
polypeptide (SEQ ID NO:2), including deletions, insertions, and point mutations that 
may be detected using hybridization assays. Included within this definition is the 
detection of alterations to the genomic DNA sequence which encodes the 
HGPRBMY4 polypeptide (SEQ ID NO:2; e.g., by alterations in the pattern of 
restriction fragment length polymorphisms capable of hybridizing to SEQ ID NO:2), 
the inability of a selected fragment of the polypeptide of SEQ ID NO:2 to hybridize to 
a sample of genomic DNA (e.g., using allele-specific oligonucleotide probes), and 
improper or unexpected hybridization, such as hybridization to a locus other than the 
normal chromosomal locus for the polynucleotide sequence encoding the 
HGPRBMY4 polypeptide (e.g., using fluorescent in situ hybridization (FISH) to 
metaphase chromosome spreads). 

Description of the Invention 

[0090] The present invention provides a novel human member of the G-protein 

coupled receptor (GPCR) family (HGPRBMY4). Based on sequence homology, the 
protein HGPRBMY4 is a novel human GPCR. This protein sequence has been 
predicted to contain seven transmembrane domains, which is a characteristic structural 
feature of GPCRs. This orphan GPCR is expressed highly in prostate, colon, lung, and 
moderately in the heart. HGPRBMY4 polypeptides and polynucleotides are useful for 
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diagnosing diseases related to over- and under- expression of HGPRBMY4 proteins by 
identifying mutations in the HGPRBMY4 gene using HGPRBMY4 probes, or 
determining HGPRBMY4 protein or mRNA expression levels. HGPRBMY4 
polypeptides are also useful for screening compounds, which affect activity of the 
protein. The invention encompasses the polynucleotide encoding the HGPRBMY4 
polypeptide and the use of the HGPRBMY4 polynucleotide or polypeptide, or 
compositions in thereof, the screening, diagnosis, treatment, or prevention of disorders 
associated with aberrant or uncontrolled cellular growth and/ or function, such as 
neoplastic diseases (e.g., cancers and tumors), with particular regard to those diseases 
or disorders related to the prostate, colon, lung or heart. More specifically, diseases 
that can be treated with HGPRBMY4 include Benign Prostate Hyperplasia, acute heart 
failure, hypotension, hypertension, angina pectoris, myocardial infarction, psychotic, 
immune, metabolic, neurological, cardiovascular and other prostate disorders, in 
addition to, colon and lung diseases, such as, but not limited to, Crohn's disease, 
Hirschsprung's disease, colonic carcinoma, inflammatory bowel disease, Chagas' 
disease, bronchopulmonary dysplasia, post-inflammatory pseudotumor, and Pancoast's 
syndrome. 

[0091] HGPRBMY4 polypeptides are also useful for screening for compounds, 

which affect activity of the protein. Nucleic acids, encoding the HGPRBMY4 protein 
according to the present invention, were first identified, in Incyte CloneID:998550 
from a kidney tumor tissue library, through a computer search for amino acid sequence 
alignments (see Example 1). 

[0092] In one of its embodiments, the present invention encompasses a 

polypeptide comprising the amino acid sequence of SEQ ID NO:2 as shown in Figure 
1 . The HGPRBMY4 polypeptide is 3 1 8 amino acids in length and shares amino acid 
sequence homology the putative G-protein coupled receptor, RA1C. The 
HGPRBMY4 polypeptide shares 60 % identity and 77 % similarity with 299 amino 
acids of the putative G-protein coupled receptor RA1C, wherein "similar" amino acids 
are those which have the same/ similar physical properties and in many cases, the 
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function is conserved with similar residues. For example, amino acids Lysine and 
Arginine are similar. Residues such as Proline and Cysteine do not share any physical 
property and they are not considered similar. The HGPRBMY4 polypeptide shares 
58.3% identity and 66.9% similarity with the rattus norvegicus putative G-protein 
coupled receptor RA1C (Acc. No.:088628); 47% identity and 57.8% similarity with 
the mus musculus odorant receptor S18 (Acc. No.:Q9WU89); 43.8% identity and 
55.6% similarity with the mus musculus odorant receptor S46 (Acc. No.:Q9WU93); 
47.3% identity and 57.8% similarity with the mus musculus MOR 3 'BET A3 (Acc. 
No.:Q9WVD7); 47.5% identity and 62% similarity with the mus musculus MOR 
3' BETA 1 (Acc. No.:Q9WVD9); 44.4% identity and 56.9% similarity with the mus 
musculus MOR 5'BETA1 (Acc. No.:Q9WVN4); 47% identity and 60.5% similarity 
with mus musculus MOR 5'BETA2 (Acc. No.:Q9WVN5); 43.1% identity and 57.2% 
similarity with human HOR 5 'BET A3 (Acc. No.:Q9Y5Pl); and 50% identity and 
62.2% similarity with the gallus gallus olfactory receptor-like protein COR3'BETA 
(Acc. No.:Q9YH55). 

[0093] Variants of the HGPRBMY4 polypeptide are also encompassed by the 

present invention. A preferred HGPRBMY4 variant has at least 75 to 80%, more 
preferably at least 85 to 90%, and even more preferably at least 90% amino acid 
sequence identity to the amino acid sequence claimed herein, and which retains at least 
one biological, immunological, or other functional characteristic or activity of 
HGPRBMY4 polypeptide. Most preferred is a variant having at least 95% amino acid 
sequence identity to that of SEQ ID NO:2. 

[0094] In another embodiment, the present invention encompasses 

polynucleotides, which encode the HGPRBMY4 polypeptide. Accordingly, any 
nucleic acid sequence, which encodes the amino acid sequence of the HGPRBMY4 
polypeptide, can be used to produce recombinant molecules that express the 
HGPRBMY4 protein. In a particular embodiment, the present invention encompasses 
the HGPRBMY4 polynucleotide comprising the nucleic acid sequence of SEQ ID 
NO: 1 and as shown in Figure 1 . More particularly, the present invention provides the 
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HGPRBMY4 clone, deposited at the American Type Culture Collection (ATCC), 
10801 University Boulevard, Manassas, VA 20110-2209 on November 15, 2000 and 
under ATCC Accession No. PTA-2682 according to the terms of the Budapest Treaty. 
[0095] As will be appreciated by the skilled practitioner in the art, the 

degeneracy of the genetic code results in the production of a multitude of nucleotide 
sequences encoding the HGPRBMY4 polypeptide. Some of the sequences bear 
minimal homology to the nucleotide sequences of any known and naturally occurring 
gene. Accordingly, the present invention contemplates each and every possible 
variation of nucleotide sequence that could be made by selecting combinations based 
on possible codon choices. These combinations are made in accordance with the 
standard triplet genetic code as applied to the nucleotide sequence of naturally 
occurring HGPRBMY4, and all such variations are to be considered as being 
specifically disclosed. 

[0096] Although nucleotide sequences which encode the HGPRBMY4 

polypeptide and its variants are preferably capable of hybridizing to the nucleotide 
sequence of the naturally occurring HGPRBMY4 polypeptide under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding the HGPRBMY4 polypeptide, or its derivatives, which possess a 
substantially different codon usage. Codons may be selected to increase the rate at 
which expression of the peptide/polypeptide occurs in a particular prokaryotic or 
eukaryotic host in accordance with the frequency with which particular codons are 
utilized by the host. Other reasons for substantially altering the nucleotide sequence 
encoding the HGPRBMY4 polypeptide, and its derivatives, without altering the 
encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the 
naturally occurring sequence. 

[0097] The present invention also encompasses production of DNA sequences, 

or portions thereof, which encode the HGPRBMY4 polypeptide, and its derivatives, 
entirely by synthetic chemistry. After production, the synthetic sequence may be 
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inserted into any of the many available expression vectors and cell systems using 
reagents that are well known and practiced by those in the art. Moreover, synthetic 
chemistry may be used to introduce mutations into a sequence encoding the 
HGPRBMY4 polypeptide, or any fragment thereof. 

[0098] Also encompassed by the present invention are polynucleotide 

sequences that are capable of hybridizing to the claimed nucleotide sequence of 
HGPRBMY4, such as that shown in SEQ ID NO:l, under various conditions of 
stringency. Hybridization conditions are typically based on the melting temperature 
(T m ) of the nucleic acid binding complex or probe (see, G.M. Wahl and S.L. Berger, 
1987; Methods Enzymol, 152:399-407 and A.R. Kimmel, 1987; Methods ofEnzymol, 
152:507-51 1), and may be used at a defined stringency. For example, included in the 
present invention are sequences capable of hybridizing under moderately stringent 
conditions to the HGPRBMY4 polypeptide sequence of SEQ ID NO:2 and other 
sequences which are degenerate to those which encode HGPRBMY4 polypeptide (e.g., 
as a non-limiting example: prewashing solution of 2X SSC, 0.5% SDS, l.OmM EDTA, 
pH 8.0, and hybridization conditions of 50°C, 5XSSC, overnight. 
[0099] The nucleic acid sequence encoding the HGPRBMY4 protein may be 

extended utilizing a partial nucleotide sequence and employing various methods 
known in the art to detect upstream sequences such as promoters and regulatory 
elements. For example, one method, which may be employed, is restriction-site PCR, 
which utilizes universal primers to retrieve unknown sequence adjacent to a known 
locus (G. Sarkar, 1993, PCR Methods Applic, 2:318-322). In particular, genomic 
DNA is first amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are then subjected to a second 
round of PCR with the same linker primer and another specific primer internal to the 
first one. Products of each round of PCR are transcribed with an appropriate RNA 
polymerase and sequenced using reverse transcriptase. 

[00100] Inverse PCR may also be used to amplify or extend sequences using 

divergent primers based on a known region or sequence (T. Triglia et al., 1988, 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-26- 



Nucleic Acids Res., 16:8186). The primers may be designed using OLIGO 4.06 Primer 
Analysis software (National Biosciences Inc., Plymouth, MN.), or another appropriate 
program, to be 22-30 nucleotides in length, to have a GC content of 50% or more, and 
to anneal to the target sequence at temperatures about 68°-72°C. The method uses 
several restriction enzymes to generate a suitable fragment in the known region of a 
gene. The fragment is then circularized by intramolecular ligation and used as a PCR 
template. 

[00101] Another method which may be used is capture PCR which involves 

PCR amplification of DNA fragments adjacent to a known sequence in human and 
yeast artificial chromosome (YAC) DNA (M. Lagerstrom et al., 1991, PCR Methods 
Applic, 1:111-119). In this method, multiple restriction enzyme digestions and 
ligations may also be used to place an engineered double-stranded sequence into an 
unknown portion of the DNA molecule before performing PCR. J.D. Parker et al. 
(1991; Nucleic Acids Res., 19:3055-3060) provide another method which may be used 
to retrieve unknown sequences. In addition, PCR, nested primers, and 
PROMOTERFINDER libraries can be used to walk genomic DNA (Clontech, Palo 
Alto, CA). This process avoids the need to screen libraries and is useful in finding 
intron/exon junctions. 

[00102] When screening for full-length cDNAs, it is preferable to use libraries 

that have been size-selected to include larger cDNAs. Also, random-primed libraries 
are preferable, since they will contain more sequences, which contain the 5' regions of 
genes. The use of a randomly primed library may be especially preferable for 
situations in which an oligo d(T) library does not yield a full-length cDNA. Genomic 
libraries may be useful for extension of sequence into the 5' and 3' non-transcribed 
regulatory regions. 

[00103] The embodiments of the present invention can be practiced using 

methods for DNA sequencing which are well known and generally available in the art. 
The methods may employ such enzymes as the Klenow fragment of DNA polymerase 
I, SEQUENASE (US Biochemical Corp. Cleveland, OH), Taq polymerase (PE 
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Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway, 
NJ), or combinations of recombinant polymerases and proofreading exonucleases such 
as the ELONGASE Amplification System marketed by Life Technologies 
(Gaithersburg, MD). Preferably, the process is automated with machines such as the 
Hamilton Micro Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler (PTC200; 
MJ Research, Watertown, MA) and the ABI Catalyst and 373 and 377 DNA 
sequencers (PE Biosystems). 

[0100] Commercially available capillary electrophoresis systems may be used 

to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. 
In particular, capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) which are laser 
activated, and detection of the emitted wavelengths by a charge coupled device 
camera. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems) and 
the entire process — from loading of samples to computer analysis and electronic data 
display — may be computer controlled. Capillary electrophoresis is especially 
preferable for the sequencing of small pieces of DNA, which may be present in limited 
amounts in a particular sample. 

[0101] In another embodiment of the present invention, polynucleotide 

sequences or fragments thereof which encode the HGPRBMY4 polypeptide, or 
peptides thereof, may be used in recombinant DNA molecules to direct the expression 
of the HGPRBMY4 polypeptide product, or fragments or functional equivalents 
thereof, in appropriate host cells. Because of the inherent degeneracy of the genetic 
code, other DNA sequences, which encode substantially the same or a functionally 
equivalent amino acid sequence, may be produced and these sequences may be used to 
clone and express the HGPRBMY4 protein. 

[0102] As will be appreciated by those having skill in the art, it may be 

advantageous to produce HGPRBMY4 polypeptide-encoding nucleotide sequences 
possessing non-naturally occurring codons. For example, codons preferred by a 
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particular prokaryotic or eukaryotic host can be selected to increase the rate of protein 
expression or to produce a recombinant RNA transcript having desirable properties, 
such as a half-life which is longer than that of a transcript generated from the naturally 
occurring sequence. 

[0103] The nucleotide sequence of the present invention can be engineered 

using methods generally known in the art in order to alter HGPRBMY4 polypeptide- 
encoding sequences for a variety of reasons, including, but not limited to, alterations 
which modify the cloning, processing, and/ or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and 
synthetic oligonucleotides may be used to engineer the nucleotide sequences. For 
example, site-directed mutagenesis may be used to insert new restriction sites, alter 
glycosylation patterns, change codon preference, produce splice variants, or introduce 
mutations, and the like. 

[01 04] In preferred embodiments, the present invention encompasses a 

polynucleotide lacking the initiation start codon, in addition to, the resulting encoded 
polypeptide of HGPRBMY4. Specifically, the present invention encompasses the 
polynucleotide corresponding to nulceotides 4 through 954 of SEQ ID NO: 1, and the 
polypeptide corresponding to amino acids 2 through 318 of SEQ ID NO:2. Also 
encompassed are recombinant vectors comprising said encoding sequence, and host 
cells comprising said vector. 

[0105] In another embodiment of the present invention, natural, modified, or 

recombinant nucleic acid sequences encoding the HGPRBMY4 polypeptide may be 
ligated to a heterologous sequence to encode a fusion protein. For example, for 
screening peptide libraries for inhibitors of HGPRBMY4 activity, it may be useful to 
encode a chimeric HGPRBMY4 protein that can be recognized by a commercially 
available antibody. A fusion protein may also be engineered to contain a cleavage site 
located between the HGPRBMY4 protein- encoding sequence and the heterologous 
protein sequence, so that HGPRBMY4 protein may be cleaved and purified away from 
the heterologous moiety. 
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[0106] In another embodiment, sequences encoding HGPRBMY4 polypeptide 

may be synthesized in whole, or in part, using chemical methods well known in the art 
(see, for example, M.H. Caruthers et al., 1980, Nucl. Acids Res. Symp. Ser., 215-223 
and T. Horn et al., 1980, Nucl. Acids Res. Symp. Ser., 225-232). Alternatively, the 
protein itself may be produced using chemical methods to synthesize the amino acid 
sequence of HGPRBMY4 polypeptide, or a fragment or portion thereof. For example, 
peptide synthesis can be performed using various solid-phase techniques (J.Y. Roberge 
et al., 1995, Science, 269:202-204) and automated synthesis maybe achieved, for 
example, using the ABI 431 A Peptide Synthesizer (PE Biosystems). 
[0107] The newly synthesized peptide can be substantially purified by 

preparative high performance liquid chromatography (e.g., T. Creighton, 1983, 
Proteins, Structures and Molecular Principles, W.H. Freeman and Co., New York, 
NY), by reversed-phase high performance liquid chromatography, or other purification 
methods as are known in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure; Creighton, supra). In addition, the amino acid sequence of HGPRBMY4 
polypeptide or any portion thereof, may be altered during direct synthesis and/ or 
combined using chemical methods with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide. 

[0108] To express a biologically active HGPRBMY4 polypeptide or peptide, 

the nucleotide sequences encoding HGPRBMY4 polypeptide, or functional 
equivalents, may be inserted into an appropriate expression vector, i.e., a vector, which 
contains the necessary elements for the transcription and translation of the inserted 
coding sequence. 

[01 09] Methods, which are well known to those skilled in the art, may be used 

to construct expression vectors containing sequences encoding HGPRBMY4 
polypeptide and appropriate transcriptional and translational control elements. These 
methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. Such techniques are described in J. Sambrook et al., 1989, 
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Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y. 
and in F.M. Ausubel et al., 1989, Current Protocols in Molecular Biology, John Wiley 
& Sons, New York, NY. 

[01 10] A variety of expression vector/host systems may be utilized to contain 

and express sequences encoding HGPRBMY4 polypeptide. Such expression 
vector/host systems include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected 
with virus expression vectors (e.g., bacculovirus); plant cell systems transformed with 
virus expression vectors (e.g., cauliflower mosaic virus (CaMV) and tobacco mosaic 
virus (TMV)), or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. The host cell employed is not limiting to the present invention. 
[0111] "Control elements" or "regulatory sequences" are those non-translated 

regions of the vector, e.g., enhancers, promoters, 5' and 3' untranslated regions, which 
interact with host cellular proteins to carry out transcription and translation. Such 
elements may vary in their strength and specificity. Depending on the vector system 
and host utilized, any number of suitable transcription and translation elements, 
including constitutive and inducible promoters, may be used. For example, when 
cloning in bacterial systems, inducible promoters such as the hybrid lacZ promoter of 
the BLUESCRIPT phagemid (Stratagene, La Jolla, CA) or PSPORT1 plasmid (Life 
Technologies), and the like, may be used. The baculovirus polyhedrin promoter may 
be used in insect cells. Promoters or enhancers derived from the genomes of plant 
cells (e.g., heat shock, RUBISCO; and storage protein genes), or from plant viruses 
(e.g., viral promoters or leader sequences), may be cloned into the vector. In 
mammalian cell systems, promoters from mammalian genes or from mammalian 
viruses are preferred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding HGPRBMY4, vectors based on SV40 or EBV may be 
used with an appropriate selectable marker. 

[0112] In bacterial systems, a number of expression vectors may be selected, 
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depending upon the use intended for the expressed HGPRBMY4 product. For 
example, when large quantities of expressed protein are needed for the induction of 
antibodies, vectors, which direct high level expression of fusion proteins that are 
readily purified, may be used. Such vectors include, but are not limited to, the 
multifunctional E. coli cloning and expression vectors such as BLUESCRIPT 
(Stratagene), in which the sequence encoding HGPRBMY4 polypeptide may be 
ligated into the vector in-frame with sequences for the amino -terminal Met and the 
subsequent 7 residues of 6-galactosidase, so that a hybrid protein is produced; pIN 
vectors (see, G. Van Heeke and S.M. Schuster, 1989, J. Biol. Chem., 264:5503-5509); 
and the like. pGEX vectors (Promega, Madison, WI) may also be used to express 
foreign polypeptides, as fusion proteins with glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can be easily purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the presence of free 
glutathione. Proteins made in such systems may be designed to include heparin, 
thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of 
interest can be released from the GST moiety at will. 

[0113] In the yeast, Saccharomyces cerevisiae, a number of vectors containing 

constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH 
may be used. (For reviews, see F.M. Ausubel et al., supra, and Grant et al., 1987, 
Methods Enzymol, 153:516-544). 

[0114] Should plant expression vectors be desired and used, the expression of 

sequences encoding HGPRBMY4 polypeptide may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of CaMV 
may be used alone or in combination with the omega leader sequence from TMV (N. 
Takamatsu, 1987, EMBO J. , 6:307-3 1 1). Alternatively, plant promoters such as the 
small subunit of RUBISCO, or heat shock promoters, may be used (G. Coruzzi et al., 
1984, EMBO J., 3:1671-1680; R. Broglie et al., 1984, Science, 224:838-843; and J. 
Winter etal, 1991, Results Pr -obi. Cell Differ. 17:85-105). These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated 
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transfection. Such techniques are described in a number of generally available reviews 
(see, for example, S. Hobbs or L.E. Murry, In: McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York, N.Y.; pp. 191-196). 
[0115] An insect system may also be used to express HGPRBMY4 

polypeptide. For example, in one such system, Autographa californica nuclear 
polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptera 
frugiperda cells or in Trichoplusia larvae. The sequences encoding HGPRBMY4 
polypeptide may be cloned into a non-essential region of the virus such as the 
polyhedrin gene and placed under control of the polyhedrin promoter. Successful 
insertion of HGPRBMY4 polypeptide will render the polyhedrin gene inactive and 
produce recombinant virus lacking coat protein. The recombinant viruses may then be 
used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which the 
HGPRBMY4 polypeptide product may be expressed (E.K. Engelhard et al., 1994, 
Proc. Nat. Acad. Set, 91:3224-3227). 

[01 16] hi mammalian host cells, a number of viral-based expression systems 

may be utilized. In cases where an adenovirus is used as an expression vector, 
sequences encoding HGPRBMY4 polypeptide may be ligated into an adenovirus 
transcription/ translation complex containing the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be 
used to obtain a viable virus which is capable of expressing HGPRBMY4 polypeptide 
in infected host cells (J. Logan and T. Shenk, 1984, Proc. Natl. Acad. Sci., 81:3655- 
3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) 
enhancer, may be used to increase expression in mammalian host cells. 
[0117] Specific initiation signals may also be used to achieve more efficient 

translation of sequences encoding HGPRBMY4 polypeptide. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding 
HGPRBMY4 polypeptide, its initiation codon, and upstream sequences are inserted 
into the appropriate expression vector, no additional transcriptional or translational 
control signals may be needed. However, in cases where only coding sequence, or a 
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fragment thereof, is inserted, exogenous translational control signals, including the 
ATG initiation codon, should be provided. Furthermore, the initiation codon should be 
in the correct reading frame to ensure translation of the entire insert. Exogenous 
translational elements and initiation codons may be of various origins, both natural and 
synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers which are appropriate for the particular cell system that is used, such as 
those described in the literature (D. Scharf et al., 1994, Results Probl. Cell Differ., 
20:125-162). 

[0118] Moreover, a host cell strain may be chosen for its ability to modulate 

the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/ or function. Different host cells having 
specific cellular machinery and characteristic mechanisms for such post-translational 
activities (e.g., CHO, HeLa, MDCK, HEK293, and W138) are available from the 
American Type Culture Collection (ATCC), American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, and may be 
chosen to ensure the correct modification and processing of the foreign protein. 
[01 19] For long-term, high-yield production of recombinant proteins, stable 

expression is preferred. For example, cell lines which stably express HGPRBMY4 
protein may be transformed using expression vectors which may contain viral origins 
of replication and/ or endogenous expression elements and a selectable marker gene on 
the same, or on a separate, vector. Following the introduction of the vector, cells may 
be allowed to grow for 1-2 days in an enriched cell culture medium before they are 
switched to selective medium. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows the growth and recovery of cells, which 
successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 
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[0120] Any number of selection systems may be used to recover transformed 

cell lines. These include, but are not limited to, the Herpes Simplex Virus thymidine 
kinase (HSV TK), (M. Wigler et al., 1977, Cell, 11:223-32) and adenine 
phosphoribosyltransferase (I. Lowy et al, 1980, Cell, 22:817-23) genes which can be 
employed in tk" or aprf cells, respectively. Also, anti-metabolite, antibiotic or 
herbicide resistance can be used as the basis for selection; for example, dhfr, which 
confers resistance to methotrexate (M. Wigler et al., 1980, Proc. Natl. Acad. Sci., 
77:3567-70); npt, which confers resistance to the aminoglycosides neomycin and G- 
418 (F. Colbere-Garapin et al., 1981, J. Mol. Biol, 150:1-14); and als or pat, which 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively 
(Murry, supra). Additional selectable genes have been described, for example, trpB, 
which allows cells to utilize indole in place of tryptophan, or hisD, which allows cells 
to utilize histinol in place of histidine (S.C. Hartman and R.C. Mulligan, 1988, Proc. 
Natl. Acad. Sci., 85:8047-51). Recently, the use of visible markers has gained 
popularity with such markers as the anthocyanins, B-glucuronidase and its substrate 
GUS, and luciferase and its substrate luciferin, which are widely used not only to 
identify transformants, but also to quantify the amount of transient or stable protein 
expression that is attributable to a specific vector system (C.A. Rhodes et al, 1995, 
Methods Mol. Biol., 55:121-131). 

[0121] Although the presence or absence of marker gene expression suggests 

that the gene of interest is also present, the presence and expression of the desired gene 
of interest may need to be confirmed. For example, if the nucleic acid sequence 
encoding the HGPRBMY4 polypeptide is inserted within a marker gene sequence, 
recombinant cells containing sequences encoding the HGPRBMY4 polypeptide can be 
identified by the absence of marker gene function. Alternatively, a marker gene can be 
placed in tandem with a sequence encoding the HGPRBMY4 polypeptide under the 
control of a single promoter. Expression of the marker gene in response to induction 
or selection usually indicates co-expression of the tandem gene. 
[0122] Alternatively, host cells, which contain the nucleic acid, sequence 
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encoding the HGPRBMY4 polypeptide and which express HGPRBMY4 polypeptide 
product may be identified by a variety of procedures known to those having skill in the 
art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridizations and protein bioassay or immunoassay techniques, including membrane, 
solution, or chip based technologies, for the detection and/ or quantification of nucleic 
acid or protein. 

[0123] The presence of polynucleotide sequences encoding the HGPRBMY4 

polypeptide can be detected by DNA-DNA or DNA-RNA hybridization, or by 
amplification using probes or portions or fragments of polynucleotides encoding the 
HGPRBMY4 polypeptide. Nucleic acid amplification based assays involve the use of 
oligonucleotides or oligomers, based on the sequences encoding the HGPRBMY4 
polypeptide, to detect transformants containing DNA or RNA encoding the 
HGPRBMY4 polypeptide. 

[0124] A wide variety of labels and conjugation techniques are known and 

employed by those skilled in the art and may be used in various nucleic acid and 
amino acid assays. Means for producing labeled hybridization or PCR probes for 
detecting sequences related to polynucleotides encoding HGPRBMY4 polypeptide 
include oligo-labeling, nick translation, end-labeling, or PCR amplification using a 
labeled nucleotide. Alternatively, the sequences encoding HGPRBMY4 polypeptide, 
or any portions or fragments thereof, may be cloned into a vector for the production of 
an mRNA probe. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase, such as T7, T3, or SP(6) and labeled nucleotides. These procedures may 
be conducted using a variety of commercially available kits (e.g., Amersham 
Pharmacia Biotech, Promega and U.S. Biochemical Corp.). Suitable reporter 
molecules or labels which may be used include radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

[0125] Host cells transformed with nucleotide sequences encoding 
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HGPRBMY4 protein, or fragments thereof, may be cultured under conditions suitable 
for the expression and recovery of the protein from cell culture. The protein produced 
by a recombinant cell may be secreted or contained intraceliularly depending on the 
sequence and/ or the vector used. As will be understood by those having skill in the 
art, expression vectors containing polynucleotides which encode the HGPRBMY4 
protein may be designed to contain signal sequences which direct secretion of the 
HGPRBMY4 protein through a prokaryotic or eukaryotic cell membrane. Other 
constructions may be used to join nucleic acid sequences encoding the HGPRBMY4 
protein to nucleotide sequence encoding a polypeptide domain, which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan modules that 
allow purification on immobilized metals; protein A domains that allow purification on 
immobilized immunoglobulin; and the domain utilized in the FLAGS extension/ 
affinity purification system (Immunex Corp., Seattle, WA). The inclusion of cleavable 
linker sequences such as those specific for Factor XA or enterokinase (Invitrogen, San 
Diego, CA) between the purification domain and HGPRBMY4 protein may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing HGPRBMY4 and a nucleic acid encoding 6 histidine residues 
preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMAC (immobilized metal ion affinity chromatography) as 
described by J. Porath et al., 1992, Prot. Exp. Purif., 3:263-281, while the enterokinase 
cleavage site provides a means for purifying from the fusion protein. For a discussion 
of suitable vectors for fusion protein production, see D.J. Kroll et al., 1993; DNA Cell 
Biol, 12:441-453. 

[0126] In addition to recombinant production, fragments of HGPRBMY4 

polypeptide may be produced by direct peptide synthesis using solid-phase techniques 
(J. Merrifield, 1963, J. Am. Chem. Soc, 85:2149-2154). Protein synthesis may be 
performed using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using ABI 431 A Peptide Synthesizer (PE Biosystems). 
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Various fragments of HGPRBMY4 polypeptide can be chemically synthesized 
separately and then combined using chemical methods to produce the full length 
molecule. 

[0127] Human artificial chromosomes (HACs) may be used to deliver larger 

fragments of DNA than can be contained and expressed in a plasmid vector. HACs 
are linear microchromosomes which may contain DNA sequences of 10K to 10M in 
size, and contain all of the elements that are required for stable mitotic chromosome 
segregation and maintenance (see, J.J. Harrington et al., 1997, Nature Genet., 15:345- 
355). HACs of 6 to 10M are constructed and delivered via conventional delivery 
methods (e.g., liposomes, polycationic amino polymers, or vesicles) for therapeutic 
purposes. 
Diagnostic Assays 

[0128] A variety of protocols for detecting and measuring the expression of the 

HGPRBMY4 polypeptide using either polyclonal or monoclonal antibodies specific 
for the protein are known and practiced in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive with two non-interfering epitopes on the HGPRBMY4 
polypeptide is preferred, but a competitive binding assay may also be employed. 
These and other assays are described in the art as represented by the publication of R. 
Hampton et al., 1990; Serological Methods, a Laboratory Manual, APS Press, St Paul, 
MN and D.E. Maddox et al., 1983; J. Exp. Med., 158:1211-1216). 
[0129] This invention also relates to the use of HGPRBMY4 polynucleotides 

as diagnostic reagents. Detection of a mutated form of the HGPRBMY4 gene 
associated with a dysfunction will provide a diagnostic tool that can add to or define a 
diagnosis of a disease or susceptibility to a disease which results from under- 
expression, over-expression, or altered expression of HGPRBMY4. Individuals 
carrying mutations in the HGPRBMY4 gene may be detected at the DNA level by a 
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variety of techniques. 

[0130] Nucleic acids for diagnosis may be obtained from a subject's cells, such 

as from blood, urine, saliva, tissue biopsy or autopsy material. The genomic DNA 
may be used directly for detection or may be amplified enzymatically by using PCR or 
other amplification techniques prior to analysis. RNA or cDNA may also be used in 
similar fashion. Deletions and insertions can be detected by a change in size of the 
amplified product in comparison to the normal genotype. Hybridizing amplified DNA 
to labeled HGPRBMY4 polynucleotide sequences can identify point mutations. 
Perfectly matched sequences can be distinguished from mismatched duplexes by 
RNase digestion or by differences in melting temperatures. DNA sequence differences 
may also be detected by alterations in electrophoretic mobility of DNA fragments in 
gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et al., Science (1985) 230:1242. Sequence changes at specific locations may 
also be revealed by nuclease protection assays, such as RNase and SI protection or the 
chemical cleavage method (see Cotton et al., Proc. Natl. Acad. Sci., USA (1985) 
85:43297-4401). In another embodiment, an array of oligonucleotides probes 
comprising the HGPRBMY4 nucleotide sequence or fragments thereof can be 
constructed to conduct efficient screening of e.g., genetic mutations. Array technology 
methods are well known and have general applicability and can be used to address a 
variety of questions in molecular genetics including gene expression, genetic linkage, 
and genetic variability (see for example: M. Chee et al., Science, 274:610-613, 1996). 
[0131] The diagnostic assays offer a process for diagnosing or determining a 

susceptibility to infections such as bacterial, fungal, protozoan and viral infections, 
particularly infections caused by HIV-1 or HIV-2 through detection of a mutation in 
the HGPRBMY4 gene by the methods described. The invention also provides 
diagnostic assays for determining or monitoring susceptibility to the following 
conditions, diseases, or disorders: cancers; anorexia; bulimia asthma; Parkinson's 
disease; acute heart failure; hypotension; hypertension; urinary retention; osteoporosis; 
angina pectoris; myocardial infarction; ulcers; asthma; allergies; benign prostatic 
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hypertrophy; prostate intraepithelial neoplasm; and psychotic and neurological 
disorders, including anxiety, schizophrenia, manic depression, delirium, dementia, 
severe mental retardation and dyskinesias, such as Huntington's disease or Gilles dela 
Tourett's syndrome. 

[0132] In addition, infections such as bacterial, protozoan and viral infections, 

particularly infections caused by HIV-1 or HIV-2; as well as, conditions or disorders 
such as pain; cancers; anorexia; bulimia; asthma; Parkinson's disease; acute heart 
failure; hypotension; hypertension; urinary retention; osteoporosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; benign prostatic hypertrophy; prostate 
intraepithelial neoplasms; and psychotic and neurological disorders, including anxiety, 
schizophrenia, manic depression, delirium, dementia, severe mental retardation and 
dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome, can be 
diagnosed by methods comprising determining from a sample derived from a subject 
having an abnormally decreased or increased level of the HGPRBMY4 polypeptide 
(SEQ ID NO:2) or HGPRBMY4 mRNA. Decreased or increased expression can be 
measured at the RNA level using any of the methods well known in the art for the 
quantification of polynucleotides, such as, for example, PCR, RT-PCR, RNase 
protection, Northern blotting and other hybridization methods. Assay techniques that 
can be used to determine levels of a protein, such as an HGPRBMY4, in a sample 
derived from a host are well known to those of skill in the art. Such assay methods 
include radioimmunoassays, competitive-binding assays, Western Blot analysis, and 
ELISA assays. 

[0133] In another of its aspects, the present invention relates to a diagnostic kit 

for a disease or susceptibility to a disease, particularly infections such as bacterial, 
fungal, protozoan and viral infections, particularly infections caused by HIV-1 or HIV- 
2; pain; cancers; anorexia; bulimia; asthma; Parkinson's disease; acute heart failure; 
hypotension; hypertension; urinary retention; osteoporosis; angina pectoris; 
myocardial infarction; ulcers; asthma; allergies; benign prostatic hypertrophy, prostate 
intraepithelial neoplasms, and psychotic and neurological disorders, including anxiety, 
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schizophrenia, manic depression, delirium, dementia, severe medal retardation and 
dyskinesias, such as Huntington's disease or Gilles dela Tourett's syndrome, which 
comprises: 

(a) a HGPRBMY4 polynucleotide, preferably the nucleotide sequence 
of SEQ ID NO: 1, or a fragment thereof; or 

(b) a nucleotide sequence complementary to that of (a); or 

(c) a HGPRBMY4 polypeptide, preferably the polypeptide of SEQ ID 
NO: 2, or a fragment thereof; or 

(d) an antibody to a HGPRBMY4 polypeptide, preferably to the 
polypeptide of SEQ ID NO: 2, or combinations thereof. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. 

[0134] The GPCR polynucleotides which may be used in the diagnostic assays 

according to the present invention include oligonucleotide sequences, complementary 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify the HGPRBMY4-encoding nucleic acid expression in biopsied tissues in 
which expression (or under- or overexpression) of the HGPRBMY4 polynucleotide 
may be correlated with disease. The diagnostic assays may be used to distinguish 
between the absence, presence, and excess expression of HGPRBMY4, and to monitor 
regulation of HGPRBMY4 polynucleotide levels during therapeutic treatment or 
intervention. 

[0135] In a related aspect, hybridization with PCR probes which are capable of 

detecting polynucleotide sequences, including genomic sequences, encoding the 
HGPRBMY4 polypeptide, or closely related molecules, may be used to identify 
nucleic acid sequences which encode the HGPRBMY4 polypeptide. The specificity of 
the probe, whether it is made from a highly specific region, e.g., about 8 to 10 
contiguous nucleotides in the 5' regulatory region, or a less specific region, e.g., 
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especially in the 3' coding region, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low) will determine whether the probe 
identifies only naturally occurring sequences encoding HGPRBMY4 polypeptide, 
alleles thereof, or related sequences. 

[0136] Probes may also be used for the detection of related sequences, and 

should preferably contain at least 50% of the nucleotides encoding the HGPRBMY4 
polypeptide. The hybridization probes of this invention may be DNA or RNA and 
may be derived from the nucleotide sequence of SEQ ID NO: 1, or from genomic 
sequence including promoter, enhancer elements, and introns of the naturally 
occurring HGPRBMY4 protein. 

[0137] Methods for producing specific hybridization probes for DNA encoding 

the HGPRBMY4 polypeptide include the cloning of a nucleic acid sequence that 
encodes the HGPRBMY4 polypeptide, or HGPRBMY4 derivatives, into vectors for 
the production of mRNA probes. Such vectors are known in the art, commercially 
available, and may be used to synthesize RNA probes in vitro by means of the addition 
of the appropriate RNA polymerases and the appropriate labeled nucleotides. 
Hybridization probes may be labeled by a variety of detector/ reporter groups, e.g., 
radionuclides such as 32 P or 35 S, or enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/ biotin coupling systems, and the like. 
[0138] The polynucleotide sequence encoding the HGPRBMY4 polypeptide, 

or fragments thereof, may be used for the diagnosis of disorders associated with 
expression of HGPRBMY4. Examples of such disorders or conditions are described 
above for "Therapeutics". The polynucleotide sequence encoding the HGPRBMY4 
polypeptide may be used in Southern or Northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; or in dip stick, pin, ELISA or 
chip assays utilizing fluids or tissues from patient biopsies to detect the status of, e.g., 
levels or overexpression of HGPRBMY4, or to detect altered HGPRBMY4 
expression. Such qualitative or quantitative methods are well known in the art. 
[0139] In a particular aspect, the nucleotide sequence encoding the 
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HGPRBMY4 polypeptide may be useful in assays that detect activation or induction of 
various neoplasms or cancers, particularly those mentioned supra. The nucleotide 
sequence encoding the HGPRBMY4 polypeptide may be labeled by standard methods, 
and added to a fluid or tissue sample from a patient, under conditions suitable for the 
formation of hybridization complexes. After a suitable incubation period, the sample 
is washed and the signal is quantified and compared with a standard value. If the 
amount of signal in the biopsied or extracted sample is significantly altered from that 
of a comparable control sample, the nucleotide sequence has hybridized with 
nucleotide sequence present in the sample, and the presence of altered levels of 
nucleotide sequence encoding the HGPRBMY4 polypeptide in the sample indicates 
the presence of the associated disease. Such assays may also be used to evaluate the 
efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 
trials, or in monitoring the treatment of an individual patient. 
[0140] To provide a basis for the diagnosis of disease associated with 

expression of HGPRBMY4, a normal or standard profile for expression is established. 
This may be accomplished by combining body fluids or cell extracts taken from 
normal subjects, either animal or human, with a sequence, or a fragment thereof, which 
encodes the HGPRBMY4 polypeptide, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values 
obtained from normal subjects with those from an experiment where a known amount 
of a substantially purified polynucleotide is used. Standard values obtained from 
normal samples may be compared with values obtained from samples from patients 
who are symptomatic for disease. Deviation between standard and subject (patient) 
values is used to establish the presence of disease. 

[0141] Once disease is established and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to evaluate whether the level 
of expression in the patient begins to approximate that which is observed in a normal 
individual. The results obtained from successive assays may be used to show the 
efficacy of treatment over a period ranging from several days to months. 
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[0142] With respect to cancer, the presence of an abnormal amount of 

transcript in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to 
the appearance of actual clinical symptoms. A more definitive diagnosis of this type 
may allow health professionals to employ preventative measures or aggressive 
treatment earlier, thereby preventing the development or further progression of the 
cancer. 

[0143] Additional diagnostic uses for oligonucleotides designed from the 

nucleic acid sequence encoding the HGPRBMY4 polypeptide may involve the use of 
PCR. Such oligomers may be chemically synthesized, generated enzymatically, or 
produced from a recombinant source. Oligomers will preferably comprise two 
nucleotide sequences, one with sense orientation (5'->3') and another with antisense 
(3'— »5'), employed under optimized conditions for identification of a specific gene or 
condition. The same two oligomers, nested sets of oligomers, or even a degenerate 
pool of oligomers may be employed under less stringent conditions for detection and/ 
or quantification of closely related DNA or RNA sequences. 
[0144] Methods suitable for quantifying the expression of HGPRBMY4 

include radiolabeling or biotinylating nucleotides, co-amplification of a control nucleic 
acid, and standard curves onto which the experimental results are interpolated (P.C. 
Melby et al., 1993, J. Immunol. Methods, 159:235-244; and C. Duplaa et al., 1993, 
Anal. Biochem., 229-236). The speed of quantifying multiple samples may be 
accelerated by running the assay in an ELISA format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantification. 

Therapeutic Assays 

[0145] HGPRBMY4 polypeptide shares homology with a putative G-protein 

coupled receptor (RA1C). HGPRBMY4 may play a role in prostate-, colon-, lung-, or 
cardiovascular- related disorders, and/ or in cell signaling and/ or cell cycle regulation. 
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The HGPRBMY4 protein may be further involved in neoplastic, neurological, 
cardiovascular, and prostate-, colon-, lung-, or cardiovascular- related disorders, where 
it may also be associated with cell cycle and cell signaling activities, as described 
further below. 

[0146] In one embodiment of the present invention, the HGPRBMY4 protein 

may play a role in neoplastic disorders. An antagonist of the HGPRBMY4 
polypeptide may be administered to an individual to prevent or treat a neoplastic 
disorder. Such disorders may include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, and teratocarcinoma, and 
particularly, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, 
cervix, colon, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 
muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus. In a related aspect, an antibody which specifically 
binds to HGPRBMY4 may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which 
express the HGPRBMY4 polypeptide. 

[0147] In another embodiment of the present invention, an antagonist or 

inhibitory agent of the HGPRBMY4 polypeptide may be administered to a subject to 
prevent or treat a neurological disorder. Such disorders may include, but are not 
limited to, akathesia, Alzheimer's disease, amnesia, amyotrophic lateral sclerosis, 
bipolar disorder, catatonia, cerebral neoplasms, dementia, depression, Down's 
syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's disease, multiple 
sclerosis, Parkinson's disease, paranoid psychoses, schizophrenia, and Tourette's 
disorder. 

[0148] In another embodiment of the present invention, an antagonist or 

inhibitory agent of the HGPRBMY4 polypeptide may be administered to an individual 
to prevent or treat an immune disorder. Such disorders may include, but are not 
limited to, AIDS, Addison's disease, adult respiratory distress syndrome, allergies, 
anemia, asthma, atherosclerosis, bronchitis, cholecystitis, Crohn's disease, ulcerative 
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colitis, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, erythema 
nodosum, atrophic gastritis, glomerulonephritis, gout, Graves' disease, 
hypereosinophilia, irritable bowel syndrome, lupus erythematosus, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, 
osteoporosis, pancreatitis, polymyositis, rheumatoid arthritis, scleroderma, Sjogren's 
syndrome, and autoimmune thyroiditis; complications of cancer, hemodialysis, 
extracorporeal circulation; viral, bacterial, fungal, parasitic, protozoal, and helminthic 
infections and trauma. 

[0149] In a preferred embodiment of the present invention, an antagonist or 

inhibitory agent of the HGPRBMY4 polypeptide may be administered to an individual 
to prevent or treat a prostate-, colon-, lung-, and/or cardiovascular- related disorder, 
particularly since HGPRBMY4 is highly expressed in prostate, colon, and lung, while 
moderately expressed in the heart. Such conditions or disorders may include, but are 
not limited to, prostatitis, benign prostatic hyperplasia, prostate intraepithelial 
neoplasms, urogenital cancers, Crohn's disease, Hirschsprung's disease, inflammatory 
bowel disease, Chagas' disease, bronchopulmonary disease, post-inflammatory 
pseudotumor, Pancoast's syndrome, and cardiovascular diseases. 
[0150] In preferred embodiments, the HGPRBMY4 polynucleotides and 

polypeptides, including agonists, antagonists, and fragments thereof, are useful for 
modulating intracellular Ca 2+ levels, modulating Ca 2+ sensitive signaling pathways, 
and modulating NFAT element associated signaling pathways. 
[0151] In another embodiment of the present invention, an expression vector 

containing the complement of the polynucleotide encoding HGPRBMY4 polypeptide 
may be administered to an individual to treat or prevent a neoplastic disorder, 
including, but not limited to, the types of cancers and tumors described above. 
[0152] In a further embodiment of the present invention, an expression vector 

containing the complement of the polynucleotide encoding HGPRBMY4 polypeptide 
may be administered to an individual to treat or prevent a neurological disorder, 
including, but not limited to, the types of disorders described above. 
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[0153] In yet another embodiment of the present invention, an expression 

vector containing the complement of the polynucleotide encoding HGPRBMY4 
polypeptide may be administered to an individual to treat or prevent a prostate- related 
disorder, including, but not limited to, prostatitis, benign prostatic hyperplasia, prostate 
intraepithelial neoplasms, and urogenital cancers. Additionally, the present invention 
may be used to treat or prevent a colon- or lung- related disease, disorder, or condition, 
including, but not limited to, Crohn's disease, Hirschsprung's disease, ulceritive 
colitis, prediverticular disease of the colon, colonic diverticulitis, colonic carcinoma, 
Hand-Schuller-Christian syndrome, eosinophilic granuloma, desquamative interstitial 
pneumonia, inflammatory bowel disease, Chagas' disease, bronchopulmonary 
dysplasia, post-inflammatory pseudotumor, Pancoast's syndrome, and other lung 
diseases, including carcinoma. 

[0154] In another embodiment, the proteins, antagonists, antibodies, agonists, 

complementary sequences, or vectors of the present invention can be administered in 
combination with other appropriate therapeutic agents. Selection of the appropriate 
agents for use in combination therapy may be made by one of ordinary skill in the art, 
according to conventional pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve 
therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

[0155] Antagonists or inhibitors of the HGPRBMY4 polypeptide of the present 

invention may be produced using methods which are generally known in the art. For 
example, HGPRBMY4 transfected CHO-NFAT/CRE cell lines of the present 
invention are useful for the identification of agonists and antagonists of the 
HGPRBMY4 polypeptide. Representative uses of these cell lines would be their 
inclusion in a method of identifying HGPRBMY4 agonists and antagonists. 
Preferably, the cell lines are useful in a method for identifying a compound that 
modulates the biological activity of the HGPRBMY4 polypeptide, comprising the 
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steps of (a) combining a candidate modulator compound with a host cell expressing the 
HGPRBMY4 polypeptide having the sequence as set forth in SEQ ID NO:2; and (b) 
measuring an effect of the candidate modulator compound on the activity of the 
expressed HGPRBMY4 polypeptide. Representative vectors expressing the 
HGPRBMY4 polypeptide are referenced herein (e.g., pcDNA3.1 hygro ™) or 
otherwise known in the art. 

[0156] The cell lines are also useful in a method of screening for a compounds 

that is capable of modulating the biological activity of HGPRBMY4 polypeptide, 
comprising the steps of: (a) determining the biological activity of the HGPRBMY4 
polypeptide in the absence of a modulator compound; (b) contacting a host cell 
expression the HGPRBMY4 polypeptide with the modulator compound; and (c) 
determining the biological activity of the HGPRBMY4 polypeptide in the presence of 
the modulator compound; wherein a difference between the activity of the 
HGPRBMY4 polypeptide in the presence of the modulator compound and in the 
absence of the modulator compound indicates a modulating effect of the compound. 
Additional uses for these cell lines are described herein or otherwise known in the art. 
In particular, purified HGPRBMY4 protein, or fragments thereof, can be used to 
produce antibodies, or to screen libraries of pharmaceutical agents, to identify those 
which specifically bind HGPRBMY4. 

[0157] Antibodies specific for HGPRBMY4 polypeptide, or immunogenic 

peptide fragments thereof, can be generated using methods that have long been known 
and conventionally practiced in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments, and 
fragments produced by an Fab expression library. Neutralizing antibodies, (i.e., those 
which inhibit dimer formation) are especially preferred for therapeutic use. 
[0158] The present invention also encompasses the polypeptide sequences that 

intervene between each of the predicted HGPRBMY4 transmembrane domains. Since 
these regions are solvent accessible either extracellularly or intracellularly, they are 
particularly useful for designing antibodies specific to each region. Such antibodies 
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may be useful as antagonists or agonists of the HGPRBMY4 full-length polypeptide 
and may modulate its activity. 

[0159] The following serve as non-limiting examples of peptides or fragments 

that may be used to generate antibodies: 

MMVDPNGNES S AT YFILIGLPGLEE AQ (SEQ ID NO: 17) 

RTEHSLHEPMY (SEQ ID NO : 1 8) 

NSTTIQFD ACLLQM (SEQ ID NO : 1 9) 

HPLRHAT VLTLPRVTK (SEQ ID NO :20) 

KQLPFCRSNILSHS YCLHQDVMKLACDDIR (SEQ ID NO :2 1 ) 
KTVLGLTREAQAKA (SEQ ID NO:22) 

HRFSKRRDSP (SEQ ID NO:23) 

KTKEIRQRILRLFHVATHASEP (SEQ ID NO:24) 

[01 60] In preferred embodiments, the following N-terminal HGPRBMY4 N- 

terminal fragment deletion polypeptides are encompassed by the present invention: 
M1-Q27, M2-Q27, V3-Q27, D4-Q27, P5-Q27, N6-Q27, G7-Q27, N8-Q27, E9-Q27, 
S10-Q27, S11-Q27, A12-Q27, T13-Q27, Y14-Q27, F15-Q27, 116-Q27, L17-Q27, 118- 
Q27, G19-Q27, L20-Q27, and/or P21-Q27 of SEQ ID NO: 17. Polynucleotide 
sequences encoding these polypeptides are also provided. The present invention also 
encompasses the use of these N-terminal HGPRBMY4 N-terminal fragment deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0161] In preferred embodiments, the following C-terminal HGPRBMY4 N- 

terminal fragment deletion polypeptides are encompassed by the present invention: 
M1-Q27, M1-A26, M1-E25, M1-E24, M1-L23, M1-G22, M1-P21, M1-L20, M1-G19, 
M1-I18, M1-L17, M1-I16, M1-F15, M1-Y14, M1-T13, M1-A12, Ml-Sll, M1-S10, 
M1-E9, M1-N8, and/or M1-G7 of SEQ ID NO:17. Polynucleotide sequences 
encoding these polypeptides are also provided. The present invention also 
encompasses the use of these C-terminal HGPRBMY4 N-terminal fragment deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0162] In preferred embodiments, the following N-terminal HGPRBMY4 
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TM1-2 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: Rl-Yll, T2-Y11, E3-Y11, H4-Y11, and/or S5-Y11 of SEQ ID 
NO: 18. Polynucleotide sequences encoding these polypeptides are also provided. The 
present invention also encompasses the use of these N-terminal HGPRBMY4 TM1-2 
intertransmembrane domain deletion polypeptides as immunogenic and/or antigenic 
epitopes as described elsewhere herein. 

[0163] In preferred embodiments, the following C-terminal HGPRBMY4 

TM1-2 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: Rl-Yl 1, R1-M10, R1-P9, R1-E8, and/or R1-H7 of SEQ ID NO: 18. 
Polynucleotide sequences encoding these polypeptides are also provided. The present 
invention also encompasses the use of these C-terminal HGPRBMY4 TM1-2 
intertransmembrane domain deletion polypeptides as immunogenic and/or antigenic 
epitopes as described elsewhere herein. 

[01 64] In preferred embodiments, the following N-terminal HGPRBMY4 

TM2-3 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: N1-M14, S2-M14, T3-M14, T4-M14, 15-M14, Q6-M14, F7-M14, 
and/or D8-M14 of SEQ ID NO: 19. Polynucleotide sequences encoding these 
polypeptides are also provided. The present invention also encompasses the use of 
these N-terminal HGPRBMY4 TM2-3 intertransmembrane domain deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[01 65] In preferred embodiments, the following C-terminal HGPRBMY4 

TM2-3 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: N1-M14, N1-Q13, N1-L12, Nl-Lll, N1-C10, N1-A9, N1-D8, 
and/or N1-F7 of SEQ ID NO: 19. Polynucleotide sequences encoding these 
polypeptides are also provided. The present invention also encompasses the use of 
these C-terminal HGPRBMY4 TM2-3 intertransmembrane domain deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0166] In preferred embodiments, the following N-terminal HGPRBMY4 

TM3-4 intertransmembrane domain deletion polypeptides are encompassed by the 
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present invention: H1-K16, P2-K16, L3-K16, R4-K16, H5-K16, A6-K16, T7-K16, 
V8-K16, L9-K16, and/or T10-K16 of SEQ ED NO:20. Polynucleotide sequences 
encoding these polypeptides are also provided. The present invention also 
encompasses the use of these N-terminal HGPRBMY4 TM3-4 intertransmembrane 
domain deletion polypeptides as immunogenic and/or antigenic epitopes as described 
elsewhere herein. 

[01 67] In preferred embodiments, the following C-terminal HGPRBMY4 

TM3-4 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: H1-K16, H1-T15, H1-V14, H1-R13, H1-P12, Hl-Lll, H1-T10, Hl- 
L9, H1-V8, and/or H1-T7 of SEQ ID NO:20. Polynucleotide sequences encoding 
these polypeptides are also provided. The present invention also encompasses the use 
of these C-terminal HGPRBMY4 TM3-4 intertransmembrane domain deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0168] In preferred embodiments, the following N-terminal HGPRBMY4 

TM4-5 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: K1-R30, Q2-R30, L3-R30, P4-R30, F5-R30, C6-R30, R7-R30, S8- 
R30, N9-R30, 110-R30, L11-R30, S12-R30, H13-R30, S14-R30, Y15-R30, C16-R30, 
L17-R30, H18-R30, Q19-R30, D20-R30, V21-R30, M22-R30, K23-R30, and/or L24- 
R30 of SEQ ID NO:21. Polynucleotide sequences encoding these polypeptides are 
also provided. The present invention also encompasses the use of these N-terminal 
HGPRBMY4 TM4-5 intertransmembrane domain deletion polypeptides as 
immunogenic and/or antigenic epitopes as described elsewhere herein. 
[01 69] In preferred embodiments, the following C-terminal HGPRBMY4 

TM4-5 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: K1-R30, K1-I29, K1-D28, K1-D27, K1-C26, K1-A25, K1-L24, Kl- 
K23, K1-M22, K1-V21, K1-D20, K1-Q19, K1-H18, K1-L17, K1-C16, K1-Y15, Kl- 
S14, K1-H13, K1-S12, Kl-Lll, K1-I10, K1-N9, K1-S8, and/or K1-R7 of SEQ ID 
NO:21. Polynucleotide sequences encoding these polypeptides are also provided. The 
present invention also encompasses the use of these C-terminal HGPRBMY4 TM4-5 
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intertransmembrane domain deletion polypeptides as immunogenic and/or antigenic 
epitopes as described elsewhere herein. 

[0170] In preferred embodiments, the following N-terminal HGPRBMY4 

TM5-6 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: K1-A14, T2-A14, V3-A14, L4-A14, G5-A14, L6-A14, T7-A14, 
and/or R8-A14 of SEQ ID NO:22. Polynucleotide sequences encoding these 
polypeptides are also provided. The present invention also encompasses the use of 
these N-terminal HGPRBMY4 TM5-6 intertransmembrane domain deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0171] In preferred embodiments, the following C-terminal HGPRBMY4 

TM5-6 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: K1-A14, K1-K13, K1-A12, Kl-Qll, K1-A10, K1-E9, K1-R8, 
and/or K1-T7 of SEQ ID NO:22. Polynucleotide sequences encoding these 
polypeptides are also provided. The present invention also encompasses the use of 
these C-terminal HGPRBMY4 TM5-6 intertransmembrane domain deletion 
polypeptides as immunogenic and/or antigenic epitopes as described elsewhere herein. 
[01 72] In preferred embodiments, the following N-terminal HGPRBMY4 

TM6-7 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: H1-P10, R2-P10, F3-P10, and/or S4-P10 of SEQ ID NO:23. 
Polynucleotide sequences encoding these polypeptides are also provided. The present 
invention also encompasses the use of these N-terminal HGPRBMY4 TM6-7 
intertransmembrane domain deletion polypeptides as immunogenic and/or antigenic 
epitopes as described elsewhere herein. 

[01 73] In preferred embodiments, the following C-terminal HGPRBMY4 

TM6-7 intertransmembrane domain deletion polypeptides are encompassed by the 
present invention: H1-P10, H1-S9, H1-D8, and/or H1-R7 of SEQ ID NO:23. 
Polynucleotide sequences encoding these polypeptides are also provided. The present 
invention also encompasses the use of these C-terminal HGPRBMY4 TM6-7 
intertransmembrane domain deletion polypeptides as immunogenic and/or antigenic 
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epitopes as described elsewhere herein. 

[01 74] In preferred embodiments, the following N-terminal HGPRBMY4 C- 

terminal fragment deletion polypeptides are encompassed by the present invention: 
K1-P22, T2-P22, K3-P22, E4-P22, 15-P22, R6-P22, Q7-P22, R8-P22, I9-P22, L10- 
P22, Rl 1-P22, L12-P22, F13-P22, H14-P22, V15-P22, and/or A16-P22 of SEQ ID 
NO:24. Polynucleotide sequences encoding these polypeptides are also provided. The 
present invention also encompasses the use of these N-terminal HGPRBMY4 C- 
terminal fragment deletion polypeptides as immunogenic and/or antigenic epitopes as 
described elsewhere herein. 

[01 75] In preferred embodiments, the following C-terminal HGPRBMY4 C- 

terminal fragment deletion polypeptides are encompassed by the present invention: 
K1-P22, K1-E21, K1-S20, K1-A19, K1-H18, K1-T17, K1-A16, K1-V15, K1-H14, 
K1-F13, K1-L12, Kl-Rll, K1-L10, K1-I9, K1-R8, and/or K1-Q7 of SEQ ID NO:24. 
Polynucleotide sequences encoding these polypeptides are also provided. The present 
invention also encompasses the use of these C-terminal HGPRBMY4 C-terminal 
fragment deletion polypeptides as immunogenic and/or antigenic epitopes as described 
elsewhere herein. 

[01 76] The HGPRBMY4 polypeptides of the present invention were 

determined to comprise several phosphorylation sites based upon the Motif algorithm 
(Genetics Computer Group, Inc.). The phosphorylation of such sites may regulate 
some biological activity of the HGPRBMY4 polypeptide. For example, 
phosphorylation at specific sites may be involved in regulating the proteins ability to 
associate or bind to other molecules (e.g., proteins, ligands, substrates, DNA, etc.). In 
the present case, phosphorylation may modulate the ability of the HGPRBMY4 
polypeptide to associate with other polypeptides, particularly cognate ligand for 
HGPRBMY4, or its ability to modulate certain cellular signal pathways. 
[01 77] The HGPRBMY4 polypeptide was predicted to comprise one PKC 

phosphorylation site using the Motif algorithm (Genetics Computer Group, Inc.). In 
vivo, protein kinase C exhibits a preference for the phosphorylation of serine or 
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threonine residues. The PKC phosphorylation sites have the following consensus 
pattern: [ST]-x-[RK], where S or T represents the site of phosphorylation and 'x' an 
intervening amino acid residue. Additional information regarding PKC 
phosphorylation sites can be found in Woodget J.R., Gould K.L., Hunter T., Eur. J. 
Biochem. 161:177-184(1986), and Kishimoto A., NishiyamaK., Nakanishi H., 
Uratsuji Y., Nomura H., Takeyama Y., Nishizuka Y., J. Biol. Chem . 260:12492- 
12499(1985); which are hereby incorporated by reference herein. 
[0178] In preferred embodiments, the following PKC phosphorylation site 

polypeptide is encompassed by the present invention: M VHRF SKRRD SPL (SEQ ID 
NO:33). Polynucleotides encoding this polypeptide is also provided. The present 
invention also encompasses the use of the HGPRBMY4 PKC phosphorylation site 
polypeptide as an immunogenic and/or antigenic epitope as described elsewhere 
herein. 

[0179] The HGPRBMY4 polypeptide was predicted to comprise three casein 

kinase II phosphorylation sites using the Motif algorithm (Genetics Computer Group, 
Inc.). Casein kinase II (CK-2) is a protein serine/threonine kinase whose activity is 
independent of cyclic nucleotides and calcium. CK-2 phosphorylates many different 
proteins. The substrate specificity [1] of this enzyme can be summarized as follows: 
(1) Under comparable conditions Ser is favored over Thr.; (2) An acidic residue (either 
Asp or Glu) must be present three residues from the C-terminal of the phosphate 
acceptor site; (3) Additional acidic residues in positions +1, +2, +4, and +5 increase 
the phosphorylation rate. Most physiological substrates have at least one acidic 
residue in these positions; (4) Asp is preferred to Glu as the provider of acidic 
determinants; and (5) A basic residue at the N-terminal of the acceptor site 
decreases the phosphorylation rate, while an acidic one will increase it. 
[0180] A consensus pattern for casein kinase II phosphorylations site is as 

follows: [ST]-x(2)-[DE], wherein 'x' represents any amino acid, and S or T is the 
phosphorylation site. 

[0181] Additional information specific to casein kinase II phosphorylation site 
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domains may be found in reference to the following publication: Pinna L.A., Biochim. 
Biophys. Acta 1054:267-284(1990); which is hereby incorporated herein in its 
entirety. 

[0182] In preferred embodiments, the following casein kinase II 

phosphorylation site polypeptide is encompassed by the present invention: 
VRTEHSLHEPMYIF (SEQ ID NO:34), FLCMLSGEDILIST (SEQ ID NO:35), and/or 
AIHSLSGMESTVLL (SEQ ID NO:36). Polynucleotides encoding these polypeptides 
are also provided. The present invention also encompasses the use of this casein 
kinase II phosphorylation site polypeptide as an immunogenic and/or antigenic epitope 
as described elsewhere herein. 

[01 83] The HGPRBMY4 polypeptide was predicted to comprise two cAMP- 

and cGMP-dependent protein kinase phosphorylation site using the Motif algorithm 
(Genetics Computer Group, Inc.). There has been a number of studies relative to the 
specificity of cAMP- and cGMP-dependent protein kinases. Both types of kinases 
appear to share a preference for the phosphorylation of serine or threonine residues 
found close to at least two consecutive N-terminal basic residues. 
[0184] A consensus pattern for cAMP- and cGMP-dependent protein kinase 

phosphorylation sites is as follows: [RK](2)-x-[ST], wherein "x" represents any amino 
acid, and S or T is the phosphorylation site. 

[0185] Additional information specific to cAMP- and cGMP-dependent protein 

kinase phosphorylation sites may be found in reference to the following publication: 
Fremisco J.R., Glass D.B., Krebs E.G, J. Biol. Chem . 255:4240-4245(1980); Glass 
D.B., Smith S.B., J. Biol. Chem . 258:14797-14803(1983); and Glass D.B., El- 
Maghrabi M.R., Pilkis S.J., J. Biol. Chem . 261:2987-2993(1986); which is hereby 
incorporated herein in its entirety. 

[0186] In preferred embodiments, the following cAMP- and cGMP-dependent 

protein kinase phosphorylation site polypeptide is encompassed by the present 
invention: HRFSKRRDSPLPVI (SEQ ID NO:37). Polynucleotides encoding this 
polypeptide are also provided. The present invention also encompasses the use of this 
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cAMP- and cGMP-dependent protein kinase phosphorylation site polypeptide as an 
immunogenic and/or antigenic epitope as described elsewhere herein. 
[0187] The HGPRBMY4 polypeptide has been shown to comprise three 

glycosylation sites according to the Motif algorithm (Genetics Computer Group, Inc.). 
As discussed more specifically herein, protein glycosylation is thought to serve a 
variety of functions including: augmentation of protein folding, inhibition of protein 
aggregation, regulation of intracellular trafficking to organelles, increasing resistance 
to proteolysis, modulation of protein antigenicity, and mediation of intercellular 
adhesion. 

[0188] Asparagine glycosylation sites have the following concensus pattern, N- 

{P}-[ST]-{P}, wherein N represents the glycosylation site. However, it is well known 
that that potential N-glycosylation sites are specific to the consensus sequence Asn- 
Xaa-Ser/Thr. However, the presence of the consensus tripeptide is not sufficient to 
conclude that an asparagine residue is glycosylated, due to the fact that the folding of 
the protein plays an important role in the regulation of N-glycosylation. It has been 
shown that the presence of proline between Asn and Ser/Thr will inhibit N- 
glycosylation; this has been confirmed by a recent statistical analysis of glycosylation 
sites, which also shows that about 50% of the sites that have a proline C-terminal to 
Ser/Thr are not glycosylated. Additional information relating to asparagine 
glycosylation may be found in reference to the following publications, which are 
hereby incorporated by reference herein: Marshall R.D., Annu. Rev. Biochem . 41 :673- 
702(1972); Pless D.D., Lennarz W.J., Proc. Natl. Acad. Sci. U.S.A . 74:134-138(1977); 
Bause E., Biochem. J . 209:331-336(1983); Gavel Y., von Heijne G., Protein Eng. 
3:433-442(1990); and Miletich J.P., Broze G.J. Jr., J. Biol. Chem . 265:11397- 
11404(1990). 

[0189] In preferred embodiments, the following VDPNGNES S ATYFI (SEQ 

ID NO:38), IAVLGNLTIIYIVR (SEQ ID NO:39), and/or AIFWFNSTTIQFDA (SEQ 
ID NO:40). Polynucleotides encoding these polypeptides are also provided. The 
present invention also encompasses the use of these HGPRBMY4 asparagine 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-56- 



glycosylation site polypeptide as immunogenic and/or antigenic epitopes as described 
elsewhere herein. 

[01 90] The HGPRBMY4 polypeptide was predicted to comprise four N- 

myristoylation sites using the Motif algorithm (Genetics Computer Group, Inc.). An 
appreciable number of eukaryotic proteins are acylated by the covalent addition of 
myristate (a C14-saturated fatty acid) to their N-terminal residue via an amide linkage. 
The sequence specificity of the enzyme responsible for this modification, myristoyl 
CoA:protein N-myristoyl transferase (NMT), has been derived from the sequence of 
known N-myristoylated proteins and from studies using synthetic peptides. The 
specificity seems to be the following: i.) The N-terminal residue must be glycine; ii.) 
In position 2, uncharged residues are allowed; iii.) Charged residues, proline and 
large hydrophobic residues are not allowed; iv.) In positions 3 and 4, most, if not all, 
residues are allowed; v.) In position 5, small uncharged residues are allowed (Ala, 
Ser, Thr, Cys, Asn and Gly). Serine is favored; and vi.) In position 6, proline is not 
allowed. 

[0191] A consensus pattern for N-myristoylation is as follows: G- 

{EDRKHPFYW}-x(2)-[STAGCN]-{P}, wherein 'x' represents any amino acid, and G 
is the N-myristoylation site. 

[0192] Additional information specific to N-myristoylation sites may be found 

in reference to the following publication: Towler D.A., Gordon J.I., Adams S.P., 
Glaser L., Annu. Rev. Biochem . 57:69-99(1988); and Grand R.J.A., Biochem. J. 
258:625-638(1989); which is hereby incorporated herein in its entirety. 
[0193] In preferred embodiments, the following N-myristoylation site 

polypeptides are encompassed by the present invention: MVDPNGNESSATYFIL 
(SEQ ID NO:41), LIGLPGLEEAQFWLAF (SEQ ID NO:42), 
IHSLSGMESTVLLAMA (SEQ ID NO:43), and/or QAKAFGTCVSHVCAVF (SEQ 
ID NO:44). Polynucleotides encoding these polypeptides are also provided. The 
present invention also encompasses the use of these N-myristoylation site polypeptides 
as immunogenic and/or antigenic epitopes as described elsewhere herein. 
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[0194] Moreover, in confirmation of HGPRBMY4 representing a novel GPCR, 

the HGPRBMY4 polypeptide was predicted to comprise a G-protein coupled receptor 
motif using the Motif algorithm (Genetics Computer Group, Inc.). G-protein coupled 
receptors (also called R7G) are an extensive group of hormones, neurotransmitters, 
odorants and light receptors which transduce extracellular signals by interaction with 
guanine nucleotide-binding (G) proteins. Some examples of receptors that belong to 
this family are provided as follows: 5-hydroxytryptamine (serotonin) 1 A to IF, 2A to 
2C, 4, 5 A, 5B, 6 and 7, Acetylcholine, muscarinic-type, Ml to M5, Adenosine Al, 
A2A, A2B and A3, Adrenergic alpha-lA to -1C; alpha-2A to -2D; beta-1 to -3, 
Angiotensin II types I and II, Bombesin subtypes 3 and 4, Bradykinin Bl and B2, c3a 
and C5a anaphylatoxin, Cannabinoid CB1 and CB2, Chemokines C-C CC-CKR-1 to 
CC-CKR-8, Chemokines C-X-C CXC-CKR-1 to CXC-CKR-4, Cholecystokinin-A and 
cholecystokinin-B/gastrin, Dopamine Dl to D5, Endothelin ET-a and ET-b, fMet-Leu- 
Phe (fMLP) (N-formyl peptide), Follicle stimulating hormone (FSH-R), Galanin, 
Gastrin-releasing peptide (GRP-R), Gonadotropin-releasing hormone (GNRH-R), 
Histamine HI and H2 (gastric receptor I), Lutropin-choriogonadotropic hormone 
(LSH-R), Melanocortin MC1R to MC5R, Melatonin, Neuromedin B (NMB-R), 
Neuromedin K (NK-3R), Neuropeptide Y types 1 to 6, Neurotensin (NT-R), 
Octopamine (tyramine) from insects, Odorants, Opioids delta-, kappa- and mu-types, 
Oxytocin (OT-R), Platelet activating factor (PAF-R), Prostacyclin, Prostaglandin D2, 
Prostaglandin E2, EP1 to EP4 subtypes, Prostaglandin F2, Purinoreceptors (ATP), 
Somatostatin types 1 to 5, Substance-K (NK-2R), Substance-P (NK-1R), Thrombin, 
Thromboxane A2, Thyrotropin (TSH-R), Thyrotropin releasing factor (TRH-R), 
Vasopressin Via, V lb and V2, Visual pigments (opsins and rhodopsin), Proto- 
oncogene mas, Caenorhabditis elegans putative receptors C06G4.5, C38C10.1, 
C43C3.2,T27D1.3 and ZC84.4, Three putative receptors encoded in the genome of 
cytomegalovirus: US27, US28, and UL33., ECRF3, a putative receptor encoded in the 
genome of herpes virus saimiri. 

[0195] The structure of all GPCRs are thought to be identical. They have 
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seven hydrophobic regions, each of which most probably spans the membrane. The N- 
terminus is located on the extracellular side of the membrane and is often glycosylated, 
while the C-terminus is cytoplasmic and generally phosphorylated. Three extracellular 
loops alternate with three intracellular loops to link the seven transmembrane regions. 
Most, but not all of these receptors, lack a signal peptide. The most conserved parts of 
these proteins are the transmembrane regions and the first two cytoplasmic loops. A 
conserved acidic- Arg-aromatic triplet is present in the N-terminal extremity of the 
second cytoplasmic loop and could be implicated in the interaction with G proteins. 
[0196] The putative concensus sequence for GPCRs comprises the conserved 

triplet and also spans the major part of the third transmembrane helix, and is as 
follows: [GSTALIVMFYWC]-[GSTANCPDE]- {EDPKRH} -x(2)-[LIVMNQGA]- 
x(2)-[LIVMFT]-[GSTANC]-[LIVMFYWSTAC]-[DENH]-R-[FYWCSH]-x(2)- 
[LIVM], where "X" represents any amino acid. 

[0197] Additional information relating to G-protein coupled receptors may be 

found in reference to the following publications: Strosberg A.D., Eur. J. Biochem . 
196:1-10(1991); Kerlavage A.R., Curr. Opin. Struct. Biol . 1:394-401(1991); Probst 
W.C., Snyder L.A., Schuster D.I., Brosius J., Sealfon S.C., DNA Cell Biol . 11:1- 
20(1992); Savarese T.M., Fraser CM., Biochem. J . 283:1-9(1992); Branchek T., Curr. 
Biol . 3:315-317(1993); Stiles G.L., J. Biol. Chem . 267:6451-6454(1992); Friell T., 
Kobilka B.K., Lefkowitz R.J., Caron M.G., Trends Neurosci . 1 1 :321-324(1988); 
Stevens C.F., Curr. Biol. 1:20-22(1991); Sakurai T., Yanagisawa M., Masaki T., 
Trends Pharmacol. Sci. 13:103-107(1992); Salesse R., Remy J.J., Levin J.M., Jallal B., 
Gamier J., Biochimie 73:109-120(1991); Lancet D., Ben-Arie N., Curr. Biol . 3:668- 
674(1993); Uhl G.R., Childers S., Pasternak G., Trends Neurosci . 17:89-93(1994); 
Barnard E.A., Burnstock G., Webb T.E., Trends Pharmacol. Sci . 15:67-70(1994); 
Applebury MX., Hargrave P.A., Vision Res . 26:1881-1895(1986); Attwood T.K., 
Eliopoulos E.E., Findlay J.B.C., Gene 98:153-159(1991); 
http://www.gcrdb.uthscsa.edu/: and http://swift.embl-heidelberg.de/7tm/ . 
[0198] In preferred embodiments, the following G-protein coupled receptors 
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signature polypeptide is encompassed by the present invention: 
HSLSGMESTVLLAMAFDRYVAICHPLR (SEQ ID NO:45). Polynucleotides 
encoding this polypeptide is also provided. The present invention also encompasses 
the use of the HGPRBMY4 G-protein coupled receptors signature polypeptide as 
immunogenic and/or antigenic epitopes as described elsewhere herein. 
[0199] For the production of antibodies, various hosts including goats, rabbits, 

sheep, rats, mice, humans, and others, can be immunized by injection with the 
HGPRBMY4 polypeptide, or any fragment or oligopeptide thereof, which has 
immunogenic properties. Depending on the host species, various adjuvants may be 
used to increase the immunological response. Non-limiting examples of suitable 
adjuvants include Freund's (incomplete), mineral gels such as aluminum hydroxide or 
silica, and surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, KLH, and dinitrophenol. Adjuvants typically used in humans 
include BCG (bacilli Calmette Guerin) and Corynebacterium parvumn. 
[0200] Preferably, the peptides, fragments, or oligopeptides used to induce 

antibodies to HGPRBMY4 polypeptide (i.e., immunogens) have an amino acid 
sequence having at least five amino acids, and more preferably, at least 7-10 amino 
acids. It is also preferable that the immunogens are identical to a portion of the amino 
acid sequence of the natural protein; they may also contain the entire amino acid 
sequence of a small, naturally occurring molecule. The peptides, fragments or 
oligopeptides may comprise a single epitope or antigenic determinant or multiple 
epitopes. Short stretches of HGPRBMY4 amino acids may be fused with those of 
another protein, such as KLH, and antibodies are produced against the chimeric 
molecule. 

[0201] Monoclonal antibodies to the HGPRBMY4 polypeptide, or 

immunogenic fragments thereof, may be prepared using any technique which provides 
for the production of antibody molecules by continuous cell lines in culture. These 
include, but are not limited to, the hybridoma technique, the human B-cell hybridoma 
technique, and the EBV-hybridoma technique (G. Kohler et al., 1975, Nature, 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-60- 



256:495-497; D. Kozbor et al., 1985, J. Immunol. Methods, 81 :31-42; R.J. Cote et al., 
1983, Proc. Natl. Acad. Sci. USA, 80:2026-2030; and S.P. Cole et al., 1984, Mol. Cell 
Biol. , 62: 109-120). The production of monoclonal antibodies is well known and 
routinely used in the art. 

[0202] In addition, techniques developed for the production of "chimeric 

antibodies," the splicing of mouse antibody genes to human antibody genes to obtain a 
molecule with appropriate antigen specificity and biological activity can be used (S.L. 
Morrison et al., 1984, Proc. Natl. Acad. Sci. USA, 81:6851-6855; M.S. Neuberger et 
al., 1984, Nature, 312:604-608; and S. Takeda et al., 1985, Nature, 314:452-454). 
Alternatively, techniques described for the production of single chain antibodies may 
be adapted, using methods known in the art, to produce the HGPRBMY4 polypeptide- 
specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random 
combinatorial immunoglobulin libraries (D.R. Burton, 1991, Proc. Natl. Acad. Sci. 
USA, 88:11 120-3). Antibodies may also be produced by inducing in vivo production 
in the lymphocyte population or by screening recombinant immunoglobulin libraries 
or panels of highly specific binding reagents as disclosed in the literature (R. Orlandi 
et al., 1989, Proc. Natl. Acad. Sci. USA, 86:3833-3837 and G. Winter et al., 1991, 
Nature, 349:293-299). 

[0203] Antibody fragments which contain specific binding sites for 

HGPRBMY4 polypeptide may also be generated. For example, such fragments 
include, but are not limited to, F(ab')2 fragments which can be produced by pepsin 
digestion of the antibody molecule and Fab fragments which can be generated by 
reducing the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity (W.D. Huse et al, 1989, Science, 254.1275- 
1281). 

[0204] Various immunoassays can be used for screening to identify antibodies 

having the desired specificity. Numerous protocols for competitive binding or 
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immunoradiometric assays using either polyclonal or monoclonal antibodies with 
established specificities are well known in the art. Such immunoassays typically 
involve measuring the formation of complexes between the HGPRBMY4 polypeptide 
and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive with two non-interfering HGPRBMY4 polypeptide 
epitopes is preferred, but a competitive binding assay may also be employed (Maddox, 
supra). 

[0205] Another aspect of the invention relates to a method for inducing an 

immunological response in a mammal which comprises inoculating the mammal with 
HGPRBMY4 polypeptide, or a fragment thereof, adequate to produce antibody and/ or 
T cell immune response to protect said animal from infections such as bacterial, 
fungal, protozoan and viral infections, particularly infections caused by HIV-1 or HIV- 
2. Yet another aspect of the invention relates to a method of inducing immunological 
response in a mammal which comprises, delivering HGPRBMY4 polypeptide via a 
vector directing expression of HGPRBMY4 polynucleotide in vivo in order to induce 
such an immunological response to produce antibody to protect said animal from 
diseases. 

[0206] A further aspect of the invention relates to an immunological/ vaccine 

formulation (composition) which, when introduced into a mammalian host, induces an 
immunological response in that mammal to an HGPRBMY4 polypeptide wherein the 
composition comprises a HGPRBMY4 polypeptide or HGPRBMY4 gene. The 
vaccine formulation may further comprise a suitable carrier. Since the HGPRBMY4 
polypeptide may be broken down in the stomach, it is preferably administered 
parenterally (including subcutaneous, intramuscular, intravenous, intradermal, etc. 
injection). Formulations suitable for parenteral administration include aqueous and 
non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, 
bacteriostats and solutes which render the formulation isotonic with the blood of the 
recipient; and aqueous and non-aqueous sterile suspensions which may include 
suspending agents or thickening agents. The formulations may be presented in unit- 
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dose or multi-dose containers, for example, sealed ampoules and vials, and may be 
stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier 
immediately prior to use. The vaccine formulation may also include adjuvant systems 
for enhancing the immunogenicity of the formulation, such as oil-in-water systems and 
other systems known in the art. The dosage will depend on the specific activity of the 
vaccine and can be readily determined by routine experimentation. 
[0207] In an embodiment of the present invention, the polynucleotide encoding 

the HGPRBMY4 polypeptide, or any fragment or complement thereof, may be used 
for therapeutic purposes. In one aspect, antisense, to the polynucleotide encoding the 
HGPRBMY4 polypeptide, may be used in situations in which it would be desirable to 
block the transcription of the mRNA. In particular, cells may be transformed with 
sequences complementary to polynucleotides encoding HGPRBMY4 polypeptide. 
Thus, complementary molecules may be used to modulate HGPRBMY4 
polynucleotide and polypeptide activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligomers or 
oligonucleotides, or larger fragments, can be designed from various locations along the 
coding or control regions of polynucleotide sequences encoding HGPRBMY4 
polypeptide. 

[0208] Expression vectors derived from retroviruses, adenovirus, herpes or 

vaccinia viruses, or from various bacterial plasmids may be used for delivery of 
nucleotide sequences to the targeted organ, tissue or cell population. Methods, which 
are well known to those skilled in the art, can be used to construct recombinant vectors 
which will express a nucleic acid sequence that is complementary to the nucleic acid 
sequence encoding the HGPRBMY4 polypeptide. These techniques are described 
both in J. Sambrook et al., supra and in F.M. Ausubel et al., supra. 
[0209] Polypeptides used in treatment can also be generated endogenously in 

the subject, in treatment modalities often referred to as "gene therapy". Thus for 
example, cells from a subject may be engineered with a polynucleotide, such as DNA 
or RNA, to encode a polypeptide ex vivo, and for example, by the use of a retroviral 
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plasmid vector. The cells can then be introduced into the subject. 
[0210] The genes encoding the HGPRBMY4 polypeptide can be turned off by 

transforming a cell or tissue with an expression vector that expresses high levels of an 
HGPRBMY4 polypeptide-encoding polynucleotide, or a fragment thereof. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a 
cell. Even in the absence of integration into the DNA, such vectors may continue to 
transcribe RNA molecules until they are disabled by endogenous nucleases. Transient 
expression may last for a month or more with a non-replicating vector, and even longer 
if appropriate replication elements are designed to be part of the vector system. 
[02 1 1 ] Modifications of gene expression can be obtained by designing 

antisense molecules or complementary nucleic acid sequences (DNA, RNA, or PNA), 
to the control, 5', or regulatory regions of the gene encoding the HGPRBMY4 
polypeptide, (e.g., signal sequence, promoters, enhancers, and introns). 
Oligonucleotides derived from the transcription initiation site, e.g., between positions - 
10 and +10 from the start site, are preferred. Similarly, inhibition can be achieved 
using "triple helix" base-pairing methodology. Triple helix pairing is useful because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding 
of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described (see, for example, J.E. Gee et al., 
1994, In: B.E. Huber and B.I. Carr, Molecular and Immunologic Approaches, Futura 
Publishing Co., Mt. Kisco, NY). The antisense molecule or complementary sequence 
may also be designed to block translation of mRNA by preventing the transcript from 
binding to ribosomes. 

[0212] Ribozymes, i.e., enzymatic RNA molecules, may also be used to 

catalyze the specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary target 
RNA, followed by endonucleolytic cleavage. Suitable examples include engineered 
hammerhead motif ribozyme molecules that can specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding HGPRBMY4 polypeptide. 
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[0213] Specific ribozyme cleavage sites within any potential RNA target are 

initially identified by scanning the target molecule for ribozyme cleavage sites which 
include the following sequences: GUA, GUU, and GUC. Once identified, short RNA 
sequences of between 15 and 20 ribonucleotides corresponding to the region of the 
target gene containing the cleavage site may be evaluated for secondary structural 
features which may render the oligonucleotide inoperable. The suitability of candidate 
targets may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 
[0214] Complementary ribonucleic acid molecules and ribozymes according to 

the invention may be prepared by any method known in the art for the synthesis of 
nucleic acid molecules. Such methods include techniques for chemically synthesizing 
oligonucleotides, for example, solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 
of DNA sequences encoding HGPRBMY4. Such DNA sequences may be 
incorporated into a wide variety of vectors with suitable RNA polymerase promoters 
such as T7 or SP. Alternatively, the cDNA constructs that constitutively or inducibly 
synthesize complementary RNA can be introduced into cell lines, cells, or tissues. 
[0215] RNA molecules may be modified to increase intracellular stability and 

half-life. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/ or 3' ends of the molecule, or the use of 
phosphorothioate or 2' O-methyl, rather than phosphodiesterase linkages within the 
backbone of the molecule. This concept is inherent in the production of PNAs and can 
be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly 
modified forms of adenine, cytosine, guanine, thymine, and uridine which are not as 
easily recognized by endogenous endonucleases. 

[0216] Many methods for introducing vectors into cells or tissues are available 

and are equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 
vectors may be introduced into stem cells taken from the patient and clonally 
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propagated for autologous transplant back into that same patient. Delivery by 
transfection and by liposome injections may be achieved using methods, which are 
well known in the art. 

[0217] Any of the therapeutic methods described above may be applied to any 

individual in need of such therapy, including, for example, mammals such as dogs, 
cats, cows, horses, rabbits, monkeys, and most preferably, humans. 
[021 8] A further embodiment of the present invention embraces the 

administration of a pharmaceutical composition, in conjunction with a 
pharmaceutically acceptable carrier, diluent, or excipient, for any of the above- 
described therapeutic uses and effects. Such pharmaceutical compositions may 
comprise HGPRBMY4 nucleic acid, polypeptide, or peptides, antibodies to 
HGPRBMY4 polypeptide, mimetics, agonists, antagonists, or inhibitors of 
HGPRBMY4 polypeptide or polynucleotide. The compositions may be administered 
alone or in combination with at least one other agent, such as a stabilizing compound, 
which may be administered in any sterile, biocompatible pharmaceutical carrier, 
including, but not limited to, saline, buffered saline, dextrose, and water. The 
compositions may be administered to a patient alone, or in combination with other 
agents, drugs, hormones, or biological response modifiers. 

[0219] The pharmaceutical compositions for use in the present invention can 

be administered by any number of routes including, but not limited to, oral, 
intravenous, intramuscular, intra- arterial, intramedullary, intrathecal, intraventricular, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, 
vaginal, or rectal means. 

[0220] In addition to the active ingredients (i.e., the HGPRBMY4 nucleic acid 

or polypeptide, or functional fragments thereof), the pharmaceutical compositions may 
contain suitable pharmaceutically acceptable carriers or excipients comprising 
auxiliaries which facilitate processing of the active compounds into preparations which 
can be used pharmaceutically. Further details on techniques for formulation and 
administration are provided in the latest edition of Remington 's Pharmaceutical 
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[0221] Pharmaceutical compositions for oral administration can be formulated 

using pharmaceutically acceptable carriers well known in the art in dosages suitable 
for oral administration. Such carriers enable the pharmaceutical compositions to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like, for ingestion by the patient. 

[0222] Pharmaceutical preparations for oral use can be obtained by the 

combination of active compounds with solid excipient, optionally grinding a resulting 
mixture, and processing the mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable excipients are carbohydrate or 
protein fillers, such as sugars, including lactose, sucrose, mannitol, or sorbitol; starch 
from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropyl-methylcellulose, or sodium carboxymethylcellulose; gums, including 
arabic and tragacanth, and proteins such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as cross-linked polyvinyl 
pyrrolidone, agar, alginic acid, or a physiologically acceptable salt thereof, such as 
sodium alginate. 

[0223] Dragee cores may be used in conjunction with physiologically suitable 

coatings, such as concentrated sugar solutions, which may also contain gum arabic, 
talc, polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/ or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or 
pigments may be added to the tablets or dragee coatings for product identification, or 
to characterize the quantity of active compound, i.e., dosage. 

[0224] Pharmaceutical preparations, which can be used orally, include push-fit 

capsules made of gelatin, as well as soft, scaled capsules made of gelatin and a 
coating, such as glycerol or sorbitol. Push-fit capsules can contain active ingredients 
mixed with a filler or binders, such as lactose or starches, lubricants, such as talc or 
magnesium stearate, and, optionally, stabilizers. In soft capsules, the active 
compounds may be dissolved or suspended in suitable liquids, such as fatty oils, 
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liquid, or liquid polyethylene glycol with or without stabilizers. 

[0225] Pharmaceutical formulations suitable for parenteral administration may 

be formulated in aqueous solutions, preferably in physiologically compatible buffers 
such as Hanks' solution, Ringer's solution, or physiologically buffered saline. 
Aqueous injection suspensions may contain substances, which increase the viscosity of 
the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. In 
addition, suspensions of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as 
sesame oil, or synthetic fatty acid esters, such as ethyloleate or triglycerides, or 
liposomes. Optionally, the suspension may also contain suitable stabilizers or agents 
which increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

[0226] For topical or nasal administration, penetrants or permeation agents that 

are appropriate to the particular barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

[0227] The pharmaceutical compositions of the present invention may be 

manufactured in a manner that is known in the art, e.g., by means of conventional 
mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping, or lyophilizing processes. 

[0228] The pharmaceutical composition may be provided as a salt and can be 

formed with many acids, including but not limited to, hydrochloric, sulfuric, acetic, 
lactic, tartaric, malic, succinic, and the like. Salts tend to be more soluble in aqueous 
solvents, or other protonic solvents, than are the corresponding free base forms. In 
other cases, the preferred preparation may be a lyophilized powder which may contain 
any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, 
at a pH range of 4.5 to 5.5, combined with a buffer prior to use. After the 
pharmaceutical compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of 
HGPRBMY4 product, such labeling would include amount, frequency, and method of 
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[0229] Pharmaceutical compositions suitable for use in the present invention 

include compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective dose or 
amount is well within the capability of those skilled in the art. For any compound, the 
therapeutically effective dose can be estimated initially either in cell culture assays, 
e.g., using neoplastic cells, or in animal models, usually mice, rabbits, dogs, or pigs. 
The animal model may also be used to determine the appropriate concentration range 
and route of administration. Such information can then be used and extrapolated to 
determine useful doses and routes for administration in humans. 
[0230] A therapeutically effective dose refers to that amount of active 

ingredient, for example, HGPRBMY4 polypeptide, or fragments thereof, antibodies to 
HGPRBMY4 polypeptide, agonists, antagonists or inhibitors of HGPRBMY4 
polypeptide, which ameliorates, reduces, or eliminates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, e.g., ED 50 (the dose 
therapeutically effective in 50% of the population) and LD 50 (the dose lethal to 50% of 
the population). The dose ratio of toxic to therapeutic effects is the therapeutic index, 
which can be expressed as the ratio, ED 5 o/LD 50 . Pharmaceutical compositions which 
exhibit large therapeutic indices are preferred. The data obtained from cell culture 
assays and animal studies are used in determining a range of dosages for human use. 
Preferred dosage contained in a pharmaceutical composition is within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage form employed, sensitivity of the 
patient, and the route of administration. 

[0231] The practitioner, who will consider the factors related to the individual 

requiring treatment, will determine the exact dosage. Dosage and administration are 
adjusted to provide sufficient levels of the active moiety or to maintain the desired 
effect. Factors, which may be taken into account, include the severity of the 
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individual's disease state, general health of the patient, age, weight, and gender of the 
patient, diet, time and frequency of administration, drug combination(s), reaction 
sensitivities, and tolerance/ response to therapy. As a general guide, long-acting 
pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
once every two weeks, depending on half-life and clearance rate of the particular 
formulation. Variations in these dosage levels can be adjusted using standard 
empirical routines for optimization, as is well understood in the art. 
[0232] Normal dosage amounts may vary from 0.1 to 100,000 micrograms 

(ug), up to a total dose of about 1 gram (g), depending upon the route of 
administration. Guidance as to particular dosages and methods of delivery is provided 
in the literature and is generally available to practitioners in the art. Those skilled in 
the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to 
particular cells, conditions, locations, and the like. 

[0233] In another embodiment of the present invention, antibodies which 

specifically bind to the HGPRBMY4 polypeptide may be used for the diagnosis of 
conditions or diseases characterized by expression (or overexpression) of the 
HGPRBMY4 polynucleotide or polypeptide, or in assays to monitor patients being 
treated with the HGPRBMY4 polypeptide, or its agonists, antagonists, or inhibitors. 
The antibodies useful for diagnostic purposes may be prepared in the same manner as 
those described herein for use in therapeutic methods. Diagnostic assays for the 
HGPRBMY4 polypeptide include methods which utilize the antibody and a label to 
detect the protein in human body fluids or extracts of cells or tissues. The antibodies 
may be used with or without modification, and may be labeled by joining them, either 
covalently or non-covalently, with a reporter molecule. A wide variety of reporter 
molecules, which are known in the art, may be used, several of which are described 
above. 

[0234] The use of mammalian cell reporter assays to demonstrate functional 

coupling of known GPCRs (G Protein Coupled Receptors) has been well documented 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-70- 



in the literature (Gilman, 1987, Boss et al., 1996; Alam & Cook, 1990; George et al., 
1997; Selbie & Hill, 1998; Rees et al., 1999). In fact, reporter assays have been 
successfully used for identifying novel small molecule agonists or antagonists against 
GPCRs as a class of drug targets (Zlokarnik et al., 1998; George et al., 1997; Boss et 
al., 1996; Rees et al, 2001). In such reporter assays, a promoter is regulated as a direct 
consequence of activation of specific signal transduction cascades following agonist 
binding to a GPCR (Alam & Cook 1990; Selbie & Hill, 1998; Boss et al., 1996; 
George et al., 1997; Gilman, 1987). 

[0235] A number of response element-based reporter systems have been 

developed that enable the study of GPCR function. These include cAMP response 
element (CRE)-based reporter genes for G alpha i/o, G alpha s- coupled GPCRs, 
Nuclear Factor Activator of Transcription (NFAT)-based reporters for G alpha q/1 lor 
the promiscuous G protein G alpha 15/16 -coupled receptors and MAP kinase reporter 
genes for use in Galpha i/o coupled receptors (Selbie & Hill, 1998; Boss et al, 1996; 
George et al., 1997; Blahos, et al., 2001; Offermann & Simon, 1995; Gilman, 1987; 
Rees et al., 2001). Transcriptional response elements that regulate the expression of 
Beta-Lactamase within a CHO Kl cell line (CHO/NFAT-CRE: Aurora Biosciences 
™) (Zlokarnik et al., 1998) have been implemented to characterize the function of the 
orphan HGPRBMY4 polypeptide of the present invention. The system enables 
demonstration of constitutive G-protein coupling to endogenous cellular signaling 
components upon intracellular overexpression of orphan receptors. Overexpression has 
been shown to represent a physiologically relevant event. For example, it has been 
shown that overexpression occurs in nature during metastatic carcinomas, wherein 
defective expression of the monocyte chemotactic protein 1 receptor, CCR2, in 
macrophages is associated with the incidence of human ovarian carcinoma (Sica, et 
al.,2000; Salcedo et al., 2000). Indeed, it has been shown that overproduction of the 
Beta 2 Adrenergic Receptor in transgenic mice leads to constitutive activation of the 
receptor signaling pathway such that these mice exhibit increased cardiac output 
(Kypson et al., 1999; Dorn et al., 1999). These are only a few of the many examples 
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demonstrating constitutive activation of GPCRs whereby many of these receptors are 
likely to be in the active, R*, conformation (J.Wess 1997) (see Example 7). 
[0236] Several assay protocols including ELISA, RIA, and FACS for 

measuring the HGPRBMY4 polypeptide are known in the art and provide a basis for 
diagnosing altered or abnormal levels of HGPRBMY4 polypeptide expression. 
Normal or standard values for HGPRBMY4 polypeptide expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, 
preferably human, with antibody to the HGPRBMY4 polypeptide under conditions 
suitable for complex formation. The amount of standard complex formation may be 
quantified by various methods; photometric means are preferred. Quantities of 
HGPRBMY4 polypeptide expressed in subject sample, control sample, and disease 
samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 
Microarrays and Screening Assays 

[0237] In another embodiment of the present invention, oligonucleotides, or 

longer fragments derived from the HGPRBMY4 polynucleotide sequence described 
herein may be used as targets in a microarray. The microarray can be used to monitor 
the expression level of large numbers of genes simultaneously (to produce a transcript 
image), and to identify genetic variants, mutations and polymorphisms. This 
information may be used to determine gene function, to understand the genetic basis of 
a disease, to diagnose disease, and to develop and monitor the activities of therapeutic 
agents. In a particular aspect, the microarray is prepared and used according to the 
methods described in WO 95/1 1995 (Chee et al.); D.J. Lockhart et al., 1996, Nature 
Biotechnology, 14:1675-1680; and M. Schena et al., 1996, Proc. Natl. Acad. Set USA, 
93:10614-10619). Microarrays are further described in U.S. Patent No. 6,015,702 to 
P. Lai et al. 

[0238] In another embodiment of this invention, the nucleic acid sequence, 

which encodes the HGPRBMY4 polypeptide, may also be used to generate 
hybridization probes, which are useful for mapping the naturally occurring genomic 
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sequence. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions (HACs), yeast 
artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries, as reviewed by CM. Price, 
1993, Blood Rev., 7:127-134 and by B.J. Trask, 1991, Trends Genet, 7:149-154. 
[0239] Fluorescent In Situ Hybridization (FISH), (as described in I. Verma et 

al., 1988, Human Chromosomes: A Manual of Basic Techniques Pergamon Press, 
New York, NY) may be correlated with other physical chromosome mapping 
techniques and genetic map data. Examples of genetic map data can be found in 
numerous scientific journals or at Online Mendelian Inheritance in Man (OMIM). 
Correlation between the location of the gene encoding the HGPRBMY4 polypeptide 
on a physical chromosomal map and a specific disease, or predisposition to a specific 
disease, may help delimit the region of DNA associated with that genetic disease. The 
nucleotide sequences, particularly that of SEQ ID NO:l, or fragments thereof, 
according to this invention may be used to detect differences in gene sequences 
between normal, carrier, or affected individuals. 

[0240] In situ hybridization of chromosomal preparations and physical 

mapping techniques such as linkage analysis using established chromosomal markers 
may be used for extending genetic maps. Often the placement of a gene on the 
chromosome of another mammalian species, such as mouse, may reveal associated 
markers, even if the number or arm of a particular human chromosome is not known. 
New sequences can be assigned to chromosomal arms, or parts thereof, by physical 
mapping. This provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. Once the disease or 
syndrome has been crudely localized by genetic linkage to a particular genomic region, 
for example, AT to llq22-23 (R.A. Gatti et al., 1988, Nature, 336:577-580), any 
sequences mapping to that area may represent associated or regulatory genes for 
further investigation. The nucleotide sequence of the present invention may also be 
used to detect differences in the chromosomal location due to translocation, inversion, 
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and the like, among normal, carrier, or affected individuals. 
[0241] In another embodiment of the present invention, the HGPRBMY4 

polypeptide, its catalytic or immunogenic fragments or oligopeptides thereof, can be 
used for screening libraries of compounds in any of a variety of drug screening 
techniques. The fragment employed in such screening may be free in solution, affixed 
to a solid support, borne on a cell surface, or located intracellularly. The formation of 
binding complexes, between HGPRBMY4 polypeptide, or portion thereof, and the 
agent being tested, may be measured utilizing techniques commonly practiced in the 
art. 

[0242] Another technique for drug screening which may be used provides for 

high throughput screening of compounds having suitable binding affinity to the protein 
of interest as described in WO 84/03564 (Venton, et al.). In this method, as applied to 
the HGPRBMY4 protein, large numbers of different small test compounds are 
synthesized on a solid substrate, such as plastic pins or some other surface. The test 
compounds are reacted with the HGPRBMY4 polypeptide, or fragments thereof, and 
washed. Bound HGPRBMY4 polypeptide is then detected by methods well known in 
the art. Purified HGPRBMY4 polypeptide can also be coated directly onto plates for 
use in the aforementioned drug screening techniques. Alternatively, non-neutralizing 
antibodies can be used to capture the peptide and immobilize it on a solid support. 
[0243] In a further embodiment of this invention, competitive drug screening 

assays can be used in which neutralizing antibodies, capable of binding the 
HGPRBMY4 polypeptide, specifically compete with a test compound for binding to 
the HGPRBMY4 polypeptide. In this manner, the antibodies can be used to detect the 
presence of any peptide, which shares one or more antigenic determinants with the 
HGPRBMY4 polypeptide. 

EXAMPLES 

[0244] The Examples herein are meant to exemplify the various aspects of 

carrying out the invention and are not intended to limit the scope of the invention in 
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any way. The Examples do not include detailed descriptions for conventional methods 
employed, such as in the construction of vectors, the insertion of cDNA into such 
vectors, or the introduction of the resulting vectors into the appropriate host. Such 
methods are well known to those skilled in the art and are described in numerous 
publication's, for example, Sambrook, Fritsch, and Maniatis, Molecular Cloning: a 
Laboratory Manual . 2 nd Edition, Cold Spring Harbor Laboratory Press, USA, (1989). 

EXAMPLE 1 
BIOINFORMATICS ANALYSIS 

[0245] G-protein coupled receptor sequences were used as a probe to search 

the Incyte and public domain EST databases. The search program used was gapped 

BLAST (S.F. Altschul, et al., Nuc. Acids Res .. 25:3389-4302 (1997)). The top EST 

hits from the BLAST results were searched back against the non-redundant protein and 

patent sequence databases. From this analysis, ESTs encoding potential novel GPCRs 

were identified based on sequence homology. The Incyte EST (CloneID:998550) was 

selected as potential novel GPCR candidate, called HGPRBMY4, for subsequent 

analysis. This EST was sequenced and the full-length clone of this GPCR was 

obtained using the EST sequence information and conventional methods. The 

complete protein sequence of HGPRBMY4 was analyzed for potential transmembrane 

domains. The TMPRED program (K. Hofmann and W. Stoffel, Biol. Chem., 347:166 

(1993)) was used for transmembrane prediction. The program predicted seven 

transmembrane domains and the predicted domains match with the predicated 

transmembrane domains of related GPCRs at the sequence level. Based on sequence, 

structure and known GPCR signature sequences, the orphan protein, HGPRBMY4, is a 

novel human GPCR. 

EXAMPLE 2 

CLONING OF THE NOVEL HUMAN GPCR HGPRBMY4 
[0246] Using the EST sequence, an antisense 80 base pair oligonucleotide with 

biotin on the 5' end was designed that was complementary to the putative coding 
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region of HGPRBMY4 as follows: 5'-b-GAT CCA CCA TCA TGA AGA AGC TGA 
ACT GTG ACC AGC ACC AGG CAG GTA GAG GCT CAA CCG TAT GGA AGG 
AAT GTG TGA CC-3' (SEQ ID NO:5). This biotinylated oligo was incubated with a 
mixture of single-stranded covalently closed circular cDNA libraries, which contained 
DNA corresponding to the sense strand. Hybrids between the biotinylated oligo and 
the circular cDNA were captured on streptavidin magnetic beads. Upon thermal 
release of the cDNA from the biotinylated oligo, the single stranded cDNA was 
converted into double strands using a primer homologous to a sequence on the cDNA 
cloning vector. The double stranded cDNA was introduced into E. coli by 
electroporation and the resulting colonies were screened by PCR, using a primer pair 
designed from the EST sequence to identify the proper cDNA. 
[0247] Oligos used to identify the cDNA by PCR were as follows: 

HGPRBMY4s (SEQ ID NO:6) 5'-ACTGAGCAC AGCCTGCAT GA-3 and 
HGPRBMY4a (SEQ ID NO:7) 5'-b-TCTGTAGCA GACAAGCAT CAAACTG -3' 
[0248] Those cDNA clones that were positive by PCR had the inserts sized and 

two of the largest clones (4.5 Kb and 3.3 Kb) were chosen for DNA sequencing. Both 
clones had identical sequence over the common regions. 

EXAMPLE 3 

EXPRESSION PROFILING OF NOVEL HUMAN GPCR, HGPRBMY4 
[0249] The same PCR primer pair used to identify HGPRBMY4 cDNA clones 

(HGPRBMY4s-SEQ ID NO:6 and HGPRBMY4a- SEQ ID NO:7) was used to 
measure the steady state levels of mRNA by quantitative PCR. Briefly, first strand 
cDNA was made from commercially available mRNA. The relative amount of cDNA 
used in each assay was determined by performing a parallel experiment using a primer 
pair for the cyclophilin gene, which is expressed in equal amounts in all tissues. The 
cyclophilin primer pair detected small variations in the amount of cDNA in each 
sample, and these data were used for normalization of the data obtained with the 
primer pair for HGPRBMY4. The PCR data were converted into a relative assessment 
of the difference in transcript abundance among the tissues tested and the data are 
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presented in Figure 7. Transcripts corresponding to the orphan GPCR, HGPRBMY4, 
were found to be highly expressed in prostate and moderately in heart. 

EXAMPLE 4 

G-PROTEIN COUPLED RECEPTOR PCR EXPRESSION PROFILING 
[0250] RNA quantification was performed using the Taqman® real-time-PCR 

fluorogenic assay. The Taqman® assay is one of the most precise methods for 
assaying the concentration of nucleic acid templates. 

[0251] All cell lines were grown using standard conditions: RPMI 1640 

supplemented with 10% fetal bovine serum, 100 IU/ml penicillin, 100 mg/ml 
streptomycin, and 2 mM L-glutamine, 10 mM Hepes (all from GibcoBRL; Rockville, 
MD). Eighty percent confluent cells were washed twice with phosphate-buffered saline 
(GibcoBRL) and harvested using 0.25% trypsin (GibcoBRL). RNA was prepared 
using the RNeasy Maxi Kit from Qiagen (V alencia, CA). 
[0252] cDNA template for real-time PCR was generated using the 

Superscript™ First Strand Synthesis system for RT-PCR. 

[0253] SYBR Green real-time PCR reactions were prepared as follows: The 

reaction mix consisted of 20 ng first strand cDNA; 50 nM Forward Primer; 50 nM 
Reverse Primer; 0.75X SYBR Green I (Sigma); IX SYBR Green PCR Buffer 
(50mMTris-HCl pH8.3, 75mM KC1); 10%DMSO; 3mM MgCl 2 ; 300 uM each dATP, 
dGTP, dTTP, dCTP; 1 U Platinum® Taq DNA Polymerase High Fidelity (Cat# 1 1304- 
029; Life Technologies; Rockville, MD); 1 :50 dilution; ROX (Life Technologies). 
Real-time PCR was performed using an Applied Biosystems 5700 Sequence Detection 
System. Conditions were 95°C for 10 min (denaturation and activation of Platinum® 
Taq DNA Polymerase), 40 cycles of PCR (95 °C for 15 sec, 60°C for 1 min). PCR 
products are analyzed for uniform melting using an analysis algorithm built into the 
5700 Sequence Detection System. 

Forward primer: GPCR9-F1: 5'-CCTGTGCTCAACCCAATTGTCT-3' (SEQ ID 
NO:25); and 
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Reverse primer: GPCR9-R1: 5 '-ACTGACACCTAGGGCTCTGAAG-3 ' (SEQ ID 
NO:26). 

[0254] cDNA quantification used in the normalization of template quantity was 

performed using Taqman® technology. Taqman® reactions are prepared as follows: 
The reaction mix consisted of 20 ng first strand cDNA; 25 nM GAPDH-F3, Forward 
Primer; 250 nM GAPDH-R1 Reverse Primer; 200 nM GAPDH-PVIC Taqman® Probe 
(fluorescent dye labeled oligonucleotide primer); IX Buffer A (Applied Biosystems); 
5.5 mM MgC12; 300 uM dATP, dGTP, dTTP, dCTP; 1 U Amplitaq Gold (Applied 
Biosystems). GAPDH, D-glyceraldehyde -3-phosphate dehydrogenase, was used as 
control to normalize mRNA levels. 

[0255] Real-time PCR was performed using an Applied Biosystems 7700 

Sequence Detection System. Conditions were 95°C for 10 min. (denaturation and 
activation of Amplitaq Gold), 40 cycles of PCR (95°C for 15 sec, 60°C for 1 min). 
[0100] The sequences for the GAPDH oligonucleotides used in the Taqman® 

reactions are as follows: 

GAPDH-F3 -5'-AGCCGAGCCACATCGCT-3' (SEQ ID NO:27) 
GAPDH-R1 -5'-GTGACCAGGCGCCCAATAC-3' (SEQIDNO:28) 
GAPDH-PVIC Taqman® Probe -VIC-5'- 

CAAATCCGTTGACTCCGACCTTCACCTT-3 ' TAMRA (SEQ ID NO:29). 

[0256] The Sequence Detection System generates a Ct (threshold cycle) value 

that is used to calculate a concentration for each input cDNA template. cDNA levels 
for each gene of interest are normalized to GAPDH cDNA levels to compensate for 
variations in total cDNA quantity in the input sample. This is done by generating 
GAPDH Ct values for each cell line. Ct values for the gene of interest and GAPDH 
are inserted into a modified version of the 5SCt equation (Applied Biosystems Prism® 
7700 Sequence Detection System User Bulletin #2), which is used to calculate a 
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GAPDH normalized relative cDNA level for each specific cDNA. The 58Ct equation 
is as follows: relative quantity of nucleic acid template =2 55Ct = 2 (SCta - SCtb) , where 
5Cta=Ct target - Ct GAPDH, and 5Ctb = Ct reference - Ct GAPDH. (No reference 
cell line was used for the calculation of relative quantity; 8Ctb was defined as 21). 
[0257] The Graph # of Table 1 corresponds to the tissue type position number 

of Figure 8. Interestingly, HGPRBMY4 (also known as GPCR9) was found to be 
overexpressed 800 to 49,000 fold greater in colon carcinoma cell lines and 150,000 in 
the SHP-77 lung carcinoma cell line, in comparison to other cancer cell lines in the 
OCLP-1 (oncology cell line panel). 
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TABLE 1 



Graph # 


Name 


Tissue 


CtGAPDH 


GPCR9-1 


dCt 


ddCt 


Quant. 


l 


A-427 


lung 


18 


40 


22 


I 


5.0E-01 


2 


A431 




19.85 


36.19 


16.34 


-4.66 


2.5E+01 


3 


A2780/DDP-S 


ovarian 


17.89 


33.7 


15.81 


-5.19 


3.7E+01 


4 


A2780/DDP-R 


ovarian 


21.51 


40 


18.49 


-2.51 


5.7E+00 


5 


HCT116/epo5 




17.71 


40 


22.29 


1.29 


4.1E-01 


6 


A2780/TAX-R 


ovarian 


18.4 


37.62 


19.22 


-1.78 


3.4E+00 


7 


A2780/TAX-S 


ovarian 


17.83 


40 


22.17 


1.17 


4.4E-01 


8 


A549 




17.63 


32.77 


15.14 


-5.86 


5.8E+01 


9 


AIN4/myc 




17.81 


40 


22.19 


1.19 


4.4E-01 


10 


AIN4T 


breast 


17 15 


37 06 


19 91 


-1.09 


2.1E+00 


11 


AIN4 


breast 


17.49 


40 


22.51 


1.51 


3.5E-01 


12 


BT-549 




17.55 


40 


22.45 


1.45 


3.7E-01 


13 


BT-20 


breast 


17.9 


40 


22.1 


1.1 


4.7E-01 


14 


C-33A 


cervical 


17.49 


40 


22.51 


1.51 


3.5E-01 


15 


CACO-2 


colon 


17.56 


37.61 


20.05 


-0.95 


1.9E+00 


16 


Calu-3 


lung 


18.09 


40 


21.91 


0.91 


5.3E-01 


17 


Calu-6 


lung 


16.62 


40 


23.38 


2.38 


1.9E-01 


18 


BT-474 


breast 


17.65 


35.54 


17.89 


-3.11 


8.6E+00 


19 


Cx-1 


colon 


18.79 


40 


21.21 


0.21 


8.6E-01 
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Graph # 


Name 


Tissue 


CtGAPDH 


GPCR9-1 


dCt 


ddCt 


Quant. 


20 


CCRF-CEM 


leukemia 


17.07 


38.51 


21.44 


0.44 


7.4E-01 


21 


ChaGo-K-1 


lung 


17.79 


40 


22.21 


1.21 


4.3E-01 




DU4475 


breast 


18.1 


40 


21.9 


0.9 


5.4E-01 


23 




ovarian 


17.22 


36.83 


19.61 


-1.39 


2.6E+00 


24 


H3396 


breast 


18.04 


40 


21.96 


0.96 


5.1E-01 


25 


HBL100 


breast 


17.02 


34.52 


17.5 


-3.5 


1.1E+01 


26 


HCT1 16/VM4 
6 


colon 


17.87 


35.35 


17.48 


-3.52 


1.1E+01 


27 


HCT116/VP35 


colon 


17.3 


40 


22.7 


1.7 


3.1E-01 


28 


HCT116 


colon 


17.59 


35.57 


17.98 


-3.02 


8.1E+00 


29 


A2780/epo5 


ovarian 


17.54 


34.65 


17.11 


-3.89 


1.5E+01 


30 


HCT116/ras 


colon 


17.18 


40 


22.82 


1.82 


2.8E-01 


31 


HCT116/TX15 
CR 


colon 


17.36 


36.41 


19.05 


-1.95 


3.9E+00 


32 


HT-29 


colon 


17.9 


29.26 


11.36 


-9.64 


8.0E+02 


33 


HeLa 


cervical 


17.59 


35.15 


17.56 


-3.44 


1.1E+01 


34 


Her2 MCF-7 


breast 


19.26 


40 


20.74 


-0.26 


1.2E+00 


35 


HL-60 


leukemia 


17.54 


35.64 


18.1 


-2.9 


7.5E+00 


36 


HOC-76 


ovarian 


34.3 


40 


5.7 


-15.3 


Mouse 


37 


Hs 294T 


melanoma 


17.73 


40 


22.27 


1.27 


4.1E-01 


38 


HS 578T 


breast 


17.83 


34.93 


17.1 


-3.9 


1.5E+01 


39 


HT-1080 


fibrosarco 
ma 


17.16 


36.92 


19.76 


-1.24 


2.4E+00 
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Graph # 


Name 


Tissue 


CtGAPDH 


GPCR9-1 


dCt 


ddCt 


Quant. 


40 


HCT116/vivo 


colon 


17.7 


34.61 


16.91 


-4.09 


1.7E+01 


41 


HT-3 


cervical 


17.42 


40 


22.58 


1.58 


3.3E-01 


42 


K562 


leukemia 


18.42 


34.32 


15.9 


-5.1 


3.4E+01 


43 


SiHa 


cervical 


18.07 


40 


21.93 


0.93 


5.2E-01 


44 


LNCAP 


prostate 


18.17 


24.67 


6.5 


-14.5 


2.3E+04 


45 


LS 174T 


colon 


17.93 


23.35 


5.42 


-15.58 


4.9E+04 


46 


LX-1 


lung 


18.17 


34.32 


16.15 


-4.85 


2.9E+01 


47 


MCF7 


breast 


17.83 


40 


22.17 


1.17 


4.4E-01 


48 


MCF-7/AdrR 


breast 


17.23 


40 


22.77 


1.77 


2.9E-01 


49 


MDA-MB- 
175-VII 


breast 


15.72 


40 


24.28 


3.28 


1.0E-01 


50 


MDA-MB-231 


breast 


17.62 


40 


22.38 


1.38 


3.8E-01 


51 


MDA-MB-453 


breast 


17.9 


37.1 


19.2 


-1.8 


3.5E+00 


52 


MDA-MB-468 


breast 


17.49 


40 


22.51 


1.51 


3.5E-01 


53 


MDAH 2774 


breast 


16.87 


35.7 


18.83 


-2.17 


4.5E+00 


54 


ME-180 


cervical 


16.86 


40 


23.14 


2.14 


2.3E-01 


55 


MIP 


colon 


16.92 


30.42 


13.5 


-7.5 


1.8E+02 


56 


ddH20 


colon 


40 


36.21 


-3.79 


-24.79 


ND 


57 


SK-CO-1 


colon 


17.75 


40 


22.25 


1.25 


4.2E-01 


58 


LoVo 


colon 


17.64 


36.89 


19.25 


-1.75 


3.4E+00 


59 


SHP-77 


lung 


18.66 


22.42 


3.76 


-17.24 


1.5E+05 


60 


T84 


colon 


16.44 


29.81 


13.37 


-7.63 


2.0E+02 
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Graph # 


Name 


Tissue 


CtGAPDH 


GPCR9-1 


dCt 


ddCt 


Quant. 


61 


BT-483 


breast 


17.45 


40 


22.55 


1.55 


3.4E-01 


62 


CCD-I8C0 


colon, 
fibroblast 


17.19 


34.51 


17.32 


-3.68 


1.3E+01 


63 


Colo 320DM 


colon 


17.01 


32.24 


15.23 


-5.77 


5.5E+01 


64 


DMS 114 


lung 


18.14 


36.92 


18.78 


-2.22 


4.7E+00 


65 


Sk-LU-1 


lung 


15.81 


32.95 


17.14 


-3.86 


1.5E+01 


66 


SK-MES-1 


lung 


17.1 


40 


22.9 


1.9 


2.7E-01 


67 


SW1573 


lung 


17.14 


37.94 


20.8 


-0.2 


1.1E+00 


68 


SW 626 


ovarian 


16.94 


40 


23.06 


2.06 


2.4E-01 


69 


SW1271 


lung 


16.45 


40 


23.55 


2.55 


1.7E-01 


70 


SW756 


cervical 


15.59 


40 


24.41 


3.41 


9.4E-02 


71 


SW900 


lung 


18.17 


40 


21.83 


0.83 


5.6E-01 


72 


T47D 


breast 


18.86 


40 


21.14 


0.14 


9.1E-01 


73 


UACC-812 


breast 


17.06 


40 


22.94 


1.94 


2.6E-01 


74 


UPN251 


ovarian 


17.69 


40 


22.31 


1.31 


4.0E-01 


75 


ZR-75-1 


breast 


15.95 


40 


24.05 


3.05 


1.2E-01 


76 


SKBR3 


breast 


17.12 


40 


22.88 


1.88 


2.7E-01 


77 


SW403 


colon 


18.39 


29.19 


10.8 


-10.2 


1.2E+03 


78 


SW837 


colon 


18.35 


34.65 


16.3 


-4.7 


2.6E+01 


79 


CCD-112Co 


colon 


18.03 


34.95 


16.92 


-4.08 


1.7E+01 


80 


Colo201 


colon 


17.89 


40 


22.11 


1.11 


4.6E-01 
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Graph # 


Name 


Tissue 


CtGAPDH 


GPCR9-1 


dCt 


ddCt 


Quant. 


81 


PC-3 


prostate 


17.25 


40 


22.75 


1.75 


3.0E-01 


82 


OVCAR-3 


ovarian 


17.09 


40 


22.91 


1.91 


2.7E-01 


83 


SW480 


colon 


17 


32.1 


15.1 


-5.9 


6.0E+01 


84 


SW620 


colon 


17.16 


34.74 


17.58 


-3.42 


1.1E+01 


85 


SW1417 


colon 


17.22 


40 


22.78 


1.78 


2.9E-01 


86 


Colo 205 


colon 


18.02 


40 


21.98 


0.98 


5.1E-01 


87 


HCT-8 


colon 


17.44 


35.76 


18.32 


-2.68 


6.4E+00 


88 


PA-1 


ovarian 


17.33 


40 


22.67 


1.67 


3.1E-01 


89 


CCD-33Co 


colon 


17.07 


35.25 


18.18 


-2.82 


7.1E+00 


90 


MRC-5 


lung 


17.3 


40 


22.7 


1.7 


3.1E-01 


91 


Pat-21 R60 


breast 


35.59 


40 


4.41 


-16.59 


ND 


92 


NCI-H596 


lung 


17.73 


37.25 


19.52 


-1.48 


2.8E+00 


93 


MSTO-21 1H 


lung 


16.81 


36.57 


19.76 


-1.24 


2.4E+00 


94 


Caov-3 


ovarian 


15.5 


40 


24.5 


3.5 


8.8E-02 


95 


Ca Ski 


cervical 


17.38 


40 


22.62 


1.62 


3.3E-01 


96 


LSI 23 


colon 


17.65 


34.51 


16.86 


-4.14 


1.8E+01 



EXAMPLE 5 
SIGNAL TRANSDUCTION ASSAYS 



[0258] The activity of GPCRs or homologues thereof, can be measured using 

any assay suitable for the measurement of the activity of a G protein-coupled receptor, 
as commonly known in the art. Signal transduction activity of a G protein-coupled 
receptor can be monitor by monitoring intracellular Ca 2+ , cAMP, inositol 1,4,5- 
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triphosphate (IP 3 ), or 1 ,2-diacylglycerol (DAG). Assays for the measurement of 
intracellular Ca 2+ are described in Sakurai et al. (EP 480 381). Intracellular IP 3 can be 
measured using a kit available from Amersham, Inc. (Arlington Heights, IL). A kit for 
measuring intracellular cAMP is available from Diagnostic Products, Inc. (Los 
Angeles, CA). 

[0259] Activation of a G protein-coupled receptor triggers the release of Ca 2+ 

ions sequestered in the mitochondria, endoplasmic reticulum, and other cytoplasmic 
vesicles into the cytoplasm. Fluorescent dyes, e.g., fura-2, can be used to measure the 
concentration of free cytoplasmic Ca 2+ . The ester of fura-2, which is lipophilic and 
can diffuse across the cell membrane, is added to the media of the host cells expressing 
GPCRs. Once inside the cell, the fura-2 ester is hydrolyzed by cytosolic esterases to 
its non-lipophilic form, and then the dye cannot diffuse back out of the cell. The non- 
lipophilic form of fura-2 will fluoresce when it binds to free Ca 2+ . The fluorescence 
can be measured without lysing the cells at an excitation spectrum of 340 nm or 380 
nm and at fluorescence spectrum of 500 nm (Sakurai et al., EP 480 381). 
[0260] Upon activation of a G protein-coupled receptor, the rise of free 

cytosolic Ca 2+ concentrations is preceded by the hydrolysis of phosphatidylinositol 
4,5-bisphosphate. Hydrolysis of this phospholipid by the phospholipase C yields 1,2- 
diacylglycerol (DAG), which remains in the membrane, and water-soluble inositol 
1,4,5 -triphosphate (IP 3 ). Binding of ligands or agonists will increase the concentration 
of DAG and IP 3 . Thus, signal transduction activity can be measured by monitoring the 
concentration of these hydrolysis products. 

[0261] To measure the IP 3 concentrations, radioactivity labeled 3 H-inositol is 

added to the media of host cells expressing GPCRs. The 3 H-inositol is taken up by the 
cells and incorporated into IP 3 . The resulting inositol triphosphate is separated from 
the mono and di-phosphate forms and measured (Sakurai et al., EP 480 381). 
Alternatively, Amersham provides an inositoll,4,5-triphosphate assay system. With 
this system Amersham provides tritylated inositol 1,4,5-triphosphate and a receptor 
capable of distinguishing the radioactive inositol from other inositol phosphates. With 
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these reagents an effective and accurate competition assay can be performed to 
determine the inositol triphosphate levels. 

[0262] Cyclic AMP levels can be measured according to the methods 

described in Gilman et al., Proc. Natl. Acad. Sci. 67:305-312 (1970). In addition, a kit 
for assaying levels of cAMP is available from Diagnostic Products Corp. (Los 
Angeles, CA). 

EXAMPLE 6 
GPCR ACTIVITY 

[0263] Another method for screening compounds which are antagonists, and 

thus inhibit activation of the receptor polypeptide of the present invention is provided. 
This involves determining inhibition of binding of labeled ligand, such as dATP, 
dAMP, or UTP, to cells which have the receptor on the surface thereof, or cell 
membranes containing the receptor. Such a method further involves transfecting a 
eukaryotic cell with DNA encoding the GPCR polypeptide such that the cell expresses 
the receptor on its surface. The cell is then contacted with a potential antagonist in the 
presence of a labeled form of a ligand, such as dATP, dAMP, or UTP. The ligand can 
be labeled, e.g., by radioactivity, fluorescence, or any detectable label commonly 
known in the art. The amount of labeled ligand bound to the receptors is measured, 
e.g., by measuring radioactivity associated with transfected cells or membrane from 
these cells. If the compound binds to the receptor, the binding of labeled ligand to the 
receptor is inhibited as determined by a reduction of labeled ligand which binds to the 
receptors. This method is called a binding assay. Naturally, this same technique can 
be used to determine agonists. 

[0264] In a further screening procedure, mammalian cells, for example, but not 

limited to, CHO, HEK 293, Xenopus Oocytes, RBL-2H3, etc., which are transfected, 
are used to express the receptor of interest. The cells are loaded with an indicator dye 
that produces a fluorescent signal when bound to calcium, and the cells are contacted 
with a test substance and a receptor agonist, such as DATP, DAMP, or UTP. Any 
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change in fluorescent signal is measured over a defined period of time using, for 
example, a fluorescence spectrophotometer or a fluorescence imaging plate reader. A 
change in the fluorescence signal pattern generated by the ligand indicates that a 
compound is a potential antagonist or agonist for the receptor. 

[0265] In yet another screening procedure, mammalian cells are transfected to 

express the receptor of interest, and are also transfected with a reporter gene construct 
that is coupled to activation of the receptor (for example, but not limited to luciferase 
or beta-galactosidase behind an appropriate promoter). The cells are contacted with a 
test substance and the receptor agonist (ligand), such as dATP, dAMP, or UTP, and the 
signal produced by the reporter gene is measured after a defined period of time. The 
signal can be measured using a luminometer, spectrophotometer, fluorimeter, or other 
such instrument appropriate for the specific reporter construct used. Inhibition of the 
signal generated by the ligand indicates that a compound is a potential antagonist for 
the receptor. 

[0266] Another screening technique for antagonists or agonists involves 

introducing RNA encoding the GPCR polypeptide into cells (or CHO, HEK 293, 
RBL-2H3, etc.) to transiently or stably express the receptor. The receptor cells are 
then contacted with the receptor ligand, such as dATP, dAMP, or UTP, and a 
compound to be screened. Inhibition or activation of the receptor is then determined 
by detection of a signal, such as, cAMP, calcium, proton, or other ions. 

EXAMPLE 7 

FUNCTIONAL CHARACTERIZATION OF HGPRBMY4 
DNA Constructs: 

[0267] The putative GPCR HGPRBMY4 cDNA was PCR amplified using 

PFU™ (Stratagene). The primers used in the PCR reaction were specific to the 
HGPRBMY4 polynucleotide and were ordered from Gibco BRL (5 prime primer: 5'- 
cccaagcttgcaccatgatggtggatcccaatggcattg-3' (SEQ ID NO:30) 3 prime primer: 5'- 
gaagatctctagggctctgaagcgtgtgtggcc-3' (SEQ ID NO:31). The following 3 prime 
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primer was used to add a Flag-tag epitope to the HGPRBMY4 polypeptide for 
immunocytochemistry: 5'- 

gaagatctctacttgtcgtcgtcgtccttgtagtccatgggctctgaagcgtgtgtggc -3'(SEQ ID NO:32). The 
product from the PCR reaction was isolated from a 0.8% Agarose gel (Invitrogen) and 
purified using a Gel Extraction Kit ™ from Qiagen. 

[0268] The purified product was then digested overnight with the pcDNA3 . 1 

Hygro ™ mammalian expression vector from Invitrogen using the Hindlll and BamHI 
restriction enzymes (New England Biolabs). These digested products were then 
purified using the Gel Extraction Eat ™ from Qiagen and subsequently ligated to the 
pcDNA3.1 Hygro ™ expression vector using a DNA molar ratio of 4 parts insert: 1 
vector. All DNA modification enzymes were purchased from NEB. The ligation was 
incubated overnight at 16°C, after which time, one microliter of the mix was used to 
transform DH5 alpha cloning efficiency competent E. coli ™ (Gibco BRL). A detailed 
description of the pcDNA3.1 Hygro ™ mammalian expression vector is available at 
the Invitrogen web site ( www.Invitrogen.com) . The plasmid DNA from the ampicillin 
resistant clones were isolated using the Wizard DNA Miniprep System ™ from 
Promega. Positive clones were then confirmed and scaled up for purification using the 
Qiagen Maxiprep ™ plasmid DNA purification kit. 

Cell Line Generation: 

[0269] The pcDNA3 . 1 hygro vector containing the orphan HGPRBMY4 cDNA 

was used to transfect CHO/NFAT-CRE or the CHO/NFAT G alpha 15 (Aurora 
Biosciences) cells using Lipofectamine 2000 ™ according to the manufacturers 
specifications (Gibco BRL). Two days later, the cells were split 1:3 into selective 
media (DMEM 11056, 600 u.g/ml Hygromycin, 200 (ag/ml Zeocin, 10% FBS). All cell 
culture reagents were purchased from Gibco BRL-Invitrogen. 

[0270] The CHO-NFAT/CRE or CHO-NFAT G alpha 1 5cell lines, transiently 

or stably transfected with the orphan HGPRBMY4 GPCR, were analyzed using the 
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FACS Vantage SE ™ (BD), fluorescence microscopy (Nikon), and the LJL Analyst ™ 
(Molecular Devices). In this system, changes in real-time gene expression, as a 
consequence of constitutive G-protein coupling of the orphan HGPRBMY4 GPCR, 
were examined by analyzing the fluorescence emission of the transformed cells at 
447nm and 518nm. The changes in gene expression were visualized using Beta- 
Lactamase as a reporter, and, when induced by the appropriate signaling cascade, 
hydrolyzed an intracellularly loaded, membrane-permeant ester substrate 
Cephalosporin-Coumarin-Fluorescein2/ Acetoxymethyl (CCF2/AM™ Aurora 
Biosciences; Zlokarnik, et al., 1998). The CCF2/AM™ substrate is a 7- 
hydroxycoumarin cephalosporin with a fluorescein attached through a stable thioether 
linkage. Induced expression of the Beta-Lactamase enzyme was readily apparent since 
each enzyme molecule produced was capable of changing the fluorescence of many 
CCF2/AM ™ substrate molecules. A schematic of this cell based system is shown 
below. 



trapped, fluorescent substrate with a cephalosporin core that links a 7- 
hydroxycoumarin to a fluorescein. For the intact molecule, excitation of the coumarin 
at 409 nm results in Fluorescence Resonance Energy Transfer (FRET) to the 
fluorescein which emits green light at 518 nm. Production of active Beta-Lactamase 
results in cleavage of the Beta-Lactam ring, leading to disruption of FRET, and 
excitation of the coumarin only - thus giving rise to blue fluorescent emission at 447 
nm. 




[0271] 



In summary, CCF2/AM ™ is a membrane permeant, intracellularly- 
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[0272] Fluorescent emissions were detected using a Nikon-TE300 microscope 

equipped with an excitation filter (D405/10X-25), dichroic reflector (430DCLP), and a 
barrier filter for dual DAPI/FITC (510nM) to visually capture changes in Beta- 
Lactamase expression. The FACS Vantage SE was equipped with a Coherent 
Enterprise II Argon Laser and a Coherent 302C Krypton laser. In flow cytometry, UV 
excitation at 351-364 nm from the Argon Laser or violet excitation at 407 nm from the 
Krypton laser were used. The optical filters on the FACS Vantage SE are HQ460/50m 
and HQ535/40m bandpass were separated by a 490 dichroic mirror. 
[0273] Prior to analyzing the fluorescent emissions from the cell lines as 

described above, the cells were loaded with the CCF2/AM substrate. A 6X CCF2/AM 
loading buffer was prepared whereby ImM CCF2/AM (Aurora Biosciences) was 
dissolved in 100% DMSO (Sigma). Stock solution (12 ul) was added to 60 ul of 
lOOmg/ml Pluronic F127 (Sigma) in DMSO containing 0.1% Acetic Acid (Sigma). 
This solution was added while vortexing to 1 mL of Sort Buffer (PBS minus calcium 
and magnesium-Gibco-25 mM HEPES-Gibco- pH 7.4, 0.1% BSA). Cells were placed 
in serum-free media and the 6X CCF2/AM was added to a final concentration of IX. 
The cells were then loaded at room temperature for one to two hours, and then 
subjected to fluorescent emission analysis as described herein. Additional details 
relative to the cell loading methods and/or instrument settings may be found by 
reference to the following publications: see Zlokarnik, et al., 1998; Whitney et al., 
1998; and BD Biosciences, 1999. 

Immunocvtochemistrv: 

[0274] The cell lines transfected and selected for expression of Flag-epitope 

tagged orphan GPCRs were analyzed by immunocytochemistry. The cells were plated 
at 1X10 3 in each well of a glass slide (VWR). The cells were rinsed with PBS 
followed by acid fixation for 30 minutes at room temperature using a mixture of 5% 
Glacial Acetic Acid / 90% ethanol. The cells were then blocked in 2% BSA and 
0.1%Triton in PBS, incubated for 2 h at room temperature or overnight at 4°C. A 
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monoclonal FITC antibody directed against FLAG was diluted at 1:50 in blocking 
solution and incubated with the cells for 2 h at room temperature. Cells were then 
washed three times with 0.1%Triton in PBS for five minutes. The slides were 
overlayed with mounting media dropwise with Biomedia -Gel Mount™ (Biomedia; 
Containing Anti-Quenching Agent). Cells were examined at lOx magnification using 
the Nikon TE300 equiped with FITC filter (535nm). 

[0275] There is strong evidence that certain GPCRs exhibit a cDNA 

concentration-dependent constitutive activity through cAMP response element (CRE) 
luciferase reporters (Chen et al., 1999). In an effort to demonstrate functional coupling 
of HGPRBMY4 to known GPCR second messenger pathways, the HGPRBMY4 
polypeptide was expressed at high constitutive levels in the CHO-NFAT/CRE cell 
line. To this end, the HGPRBMY4 cDNA was PCR amplified and subcloned into the 
pcDNA3.1 hygro ™ mammalian expression vector as described herein. Early passage 
CHO-NFAT/CRE cells were then transfected with the resulting pcDNA3.1 hygro ™ / 
HGPRBMY4 construct. Transfected and non-transfected CHO-NFAT/CRE cells 
(control) were loaded with the CCF2 substrate and stimulated with 10 nM PMA, and 1 
uM Thapsigargin (NFAT stimulator) or 10 uM Forskolin (CRE stimulator) to fully 
activate the NFAT/CRE element. The cells were then analyzed for fluorescent 
emission by Fluorescent Assisted Cell Sorter, FACS. 
[0276] The FACS profile demonstrated the constitutive activity of 

HGPRBMY4 in the CHO-NFAT/CRE line as evidenced by the significant population 
of cells with blue fluorescent emission at 447 ran (see Figure 10: Blue Cells). The 
cells were analyzed via FACS according to their wavelength emission at 518 nM 
(Channel R3 - Green Cells), and 447 nM (Channel R2 - Blue Cells). As shown, 
overexpression of HGPRBMY4 resulted in functional coupling and subsequent 
activation of beta lactamase gene expression, as evidenced by the significant number 
of cells with fluorescent emission at 447 nM relative to the non-transfected control 
CHO-NFAT/CRE cells (shown in Figure 9). 

[0277] As expected, the NFAT/CRE response element in the untransfected 
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control cell line was not activated (i.e., beta lactamase not induced), enabling the 
CCF2 substrate to remain intact, and resulting in the green fluorescent emission at 518 
nM (see Figure 9-Green Cells). The cells were analyzed via FACS according to their 
wavelength emission at 518 nM (Channel R3 - Green Cells), and 447 nM (Channel R2 
- Blue Cells). As shown, the vast majority of cells emitted at 518 nM, with minimal 
emission observed at 447 nM. The latter was expected since the NFAT/CRE response 
elements remained dormant in the absence of an activated G-protein dependent signal 
transduction pathway (e.g., pathways mediated by Gq/1 1 or Gs coupled receptors). As 
a result, the cell permeant, CCF2/AM™ (Aurora Biosciences; Zlokarnik, et al., 1998) 
substrate remained intact and emitted light at 5 18 nM. 

[0278] A very low level of leaky Beta Lactamase expression was detectable as 

evidenced by the small population of cells emitting at 447 an. Analysis of a stable 
pool of cells transfected with HGPRBMY4 revealed constitutive coupling of the cell 
population to the NFAT/CRE response element, activation of Beta Lactamase and 
cleavage of the substrate (Figure 10-Blue Cells). These results demonstrated that 
overexpression of HGPRBMY4 leads to constitutive coupling of signaling pathways 
known to be mediated by Gq/1 1 or Gs coupled receptors that converge to activate 
either the NFAT or CRE response elements respectively (Boss et al., 1996; Chen et al., 
1999). 

[0279] In an effort to further characterize the observed functional coupling of 

the HGPRBMY4 polypeptide, its ability to couple to a G protein was examined. To 
this end, the promiscuous G protein, G alpha 15 was utilized. Specific domains of 
alpha subunits of G proteins have been shown to control coupling to GPCRs (Blahos et 
al., 2001). It has been shown that the extreme C-terminal 20 amino acids of either G 
alpha 15 or 16 confer the unique ability of these G proteins to couple to many GPCRs, 
including those that naturally do not stimulate PLC (Blahos et al., 2001). Indeed, both 
G alpha 15 and 16 have been shown to couple a wide variety of GPCRs to 
Phospholipase C activation of calcium mediated signaling pathways (including the 
NFAT-signaling pathway) (Offermanns & Simon). To demonstrate that HGPRBMY4 

EXPRESS MAIL LABEL NO.: EL 912004928 US 

646785 vl 



-92- 



was functioning as a GPCR, the CHO-NFAT G alpha 15 cell line that contained only 
the integrated NFAT response element linked to the Beta-Lactamase reporter was 
transfected with the pcDNA3.1 hygro ™ / HGPRBMY4 construct. Analysis of the 
fluorescence emission from this stable pool showed that HGPRBMY4 constitutively 
coupled to the NFAT mediated second messenger pathways via G alpha 15 (see Figure 
1 1 and 12). 

[0280] In conclusion, the results were consistent with HGPRBMY4 

representing a functional GPCR analogous to known G alpha 15 coupled receptors. 
Therefore, constitutive expression of HGPRBMY4 in the CHO-NFAT G alpha 15 cell 
line lead to NFAT activation through accumulation of intracellular Ca 2+ as has been 
demonstrated for the M3 muscarinic receptor (Boss et al., 1996). 

Demonstration of Cellular Expression: 

[0281] HGPRBMY4 was tagged at the C-terminus using the Flag epitope and 

inserted into the pcDNA3.1 hygro ™ expression vector, as described herein. 
Immunocytochemistry of CHO-NFAT G alpha 15 cell lines transfected with the Flag- 
tagged HGPRBMY4 construct with FITC conjugated monoclonal antibody raised 
against FLAG demonstrated that HGPRBMY4 was indeed a cell surface receptor. The 
immunocytochemistry also confirmed expression of the HGPRBMY4 in the CHO- 
NFAT G alpha 15 cell lines. Briefly, CHO-NFAT G alpha 15 cell lines were 
transfected with pcDNA3.1 hygro ™ / HGPRBMY4-Flag vector, fixed with 70% 
methanol, and permeablized with 0.1%TritonX100. The cells were then blocked with 
1% Serum and incubated with a FITC conjugated Anti Flag monoclonal antibody at 
1:50 dilution in PBS -Triton. The cells were then washed several times with PBS- 
Triton, overlayed with mounting solution, and fluorescent images were captured (see 
Figure 13). Figure 13 shows the untransfected CHO-NFAT G alpha 15 cell line FACS 
profile. CHO-NFAT/CRE cell lines transfected with the pcDNA3.1 Hygro ™ / 
HGPRBMY4-FLAG mammalian expression vector were subjected to 
immunocytochemistry using an FITC conjugated monoclonal antibody raised against 
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FLAG, as described herein. Panel A shows the transfected CHO-NFAT/CRE cells 
under visual wavelengths, and panel B shows the fluorescent emission of the same 
cells at 530 nm after illumination with a mercury light source. The cellular 
localization is clearly evident in panel B, and is consistent with the HGPRBMY4 
polypeptide representing a member of the GPCR family. 

[0282] The control cell line, non-transfected CHO-NF AT G alpha 1 5 cell line, 

exhibited no detectable background fluorescence (Figure 13). The BMY4 -FLAG 
tagged expressing CHO-NF AT G alpha 15 line exhibited specific plasma membrane 
expression as indicated (Figure 13). These data provided clear evidence that BMY4 
was expressed in these cells and the majority of the protein was localized to the cell 
surface. Cell surface localization was consistent with HGPRBM4 representing a 7 
transmembrane domain containing GPCR. Taken together, the data indicated that 
HGPRBMY4 was a cell surface GPCR that functioned through increases in Ca 2+ 
signal transduction pathways via G alpha 15. 

Screening Paradigm: 

[0283] The Aurora Beta-Lactamase technology provided a clear path for 

identifying agonists and antagonists of the HGPRBMY4 polypeptide. Cell lines that 
exhibited a range of constitutive coupling activity were identified by sorting through 
HGPRBMY4 transfected cell lines using the FACS Vantage SE (see Figure 14). 
Figure 14 describes several CHO-NFAT/CRE cell lines transfected with the 
pcDNA3.1 Hygro ™ / HGPRBMY4 mammalian expression vector isolated via FACS 
that had either intermediate or high beta lactamase expression levels of constitutive 
activation. 

[0284] For example, cell lines were sorted that had an intermediate level of 

orphan GPCR expression, which also correlated with an intermediate coupling 
response, using the LJL analyst. Such cell lines provided the opportunity to screen, 
indirectly, for both agonists and antogonists of HGPRBMY4 by identifying inhibitors 
that blocked the beta lactamase response, or agonists that increased the beta lactamase 
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response. As described herein, modulating the expression level of beta lactamase 
directly correlated with the level of cleaved CCR2 substrate. For example, this 
screening paradigm was shown to work for the identification of modulators of a known 
GPCR, 5HT6, that couples through Adenylate Cyclase, in addition to, the 
identification of modulators of the 5HT2c GPCR, that couples through changes in [Ca 
2+ ]i. The data shown below represented cell lines that were engineered with the 
desired pattern of HGPRBMY4 expression to enable the identification of potent small 
molecule agonists and antagonists. HGPRBMY4 modulator screens may be carried 
out using a variety of high throughput methods known in the art, though preferably 
using the fully automated Aurora UHTSS system. The uninduced, orphan- transfected 
CHO-NFAT/CRE cell line represented the relative background level of beta lactamase 
expression (Figure 14; panel a). Following treatment with a cocktail of 10 nM PMA, 
luM Thapsigargin, and lOuM Forskolin (Figure 14; P/T/F; panel b), the cells fully 
activated the CRE-NFAT response element demonstrating the dynamic range of the 
assay. Panel C (Figure 14) represents an orphan transfected CHO-NFAT/CRE cell 
line that showed an intermediate level of beta lactamase expression post P/T/F 
stimulation, while panel D (Figure 14) represents a HGPRBMY4 transfected CHO- 
NFAT/CRE cell line that showd a high level of beta lactamase expression post P/T/F 
stimulation. 

[0285] Figure 14 shows that representative transfected CHO-NFAT/CRE cell 

lines with intermediate and high beta lactamase expression levels were useful in 
identifing HGPRBMY4 agonists and/or antagonists. Several CHO-NFAT/CRE cell 
lines transfected with the pcDNA3.1 Hygro ™ / HGPRBMY4 mammalian expression 
vector were isolated via FACS that had either intermediate or high beta lactamase 
expression levels of constitutive activation, as described herein. Panel A (Figure 14) 
shows untransfected CHO-NFAT/CRE cells prior to stimulation with 10 nM PMA, 1 
uM Thapsigargin, and 10 uM Forskolin ( - P/T/F). Panel B (Figure 14) shows CHO- 
NFAT/CRE cells after stimulation with 10 nM PMA,1 uM Thapsigargin, and 10 uM 
Forskolin ( + P/T/F). Panel C (Figure 14) shows a representative orphan GPCR 
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(oGPCR) transfected CHO-NFAT/CRE cells that have an intermediate level of beta 
lactamase expression. Panel D (Figure 14) shows a representative orphan GPCR 
transfected CHO-NFAT/CRE that have a high level of beta lactamase expression. 

EXAMPLE 8 

PHAGE DISPLAY METHODS FOR IDENTIFYING PEPTIDE LIGANDS OR 
MODULATORS OF ORPHAN GPCRs 

Library Construction 

[0286] Two HGPRBMY libraries were used for identifying peptides that may 

function as modulators. Specifically, a 15-mer library was used to identify peptides 
that may function as agonists or antagonists. The 15-mer library was an aliquot of the 
15-mer library originally constructed by G.P. Smith (Scott, JK and Smith, GP. 1990, 
Science 249:386-390). A 40-mer library was used for identifying natural ligands and 
constructed essentially as previously described (BK Kay, et al. 1993, Gene 128:59- 
65), with the exception that a 15 base pair complementary region was used to anneal 
the two oligonucleotides, as opposed to 6, 9, or 12 base pairs, as described below. 
[0287] The oligos used were: Oligo 1 : 5'- 

CGAAGCGTAAGGGCCCAGCCGGCC (NNK x 20) CCGGGTCCGGGCGGC-3' 
(SEQ ID NO:46) and 01igo2: 5'-AAAAGGAAAAAAGCGGCCGC (VNN x 20) 
GCCGCCCGGACCCGG-3' (SEQ ID NO:47), where N= A+G+C+T and K = C+G+T 
and V=C+A+G. 

[0288] The oligos were annealed through their 15 base pair complimentary 

sequences which encode a constant ProGlyProGlyGly (SEQ ID NO:48) pentapeptide 
sequence between the random 20 amino acid segments, and then extended by standard 
procedure using Klenow enzyme. This was followed by endonuclease digestion using 
Sfil and Notl enzymes and ligation to Sfil and Notl cleaved pCantab5E (Pharmacia). 
The ligation mixture was electroporated into E. coli XL 1 Blue and phage clones were 
essentially generated as suggested by the manufacturer for making ScFv antibody 
libraries in pCantab5E. 
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Sequencing Bound Phage 

[0289] Standard procedures commonly known in the art were used. Phage in 

eluates were infected into E. coli host strain (TGI for the 15-mer library; XLlBlue for 
the 40-mer library) and plated for single colonies. Colonies were grown in liquid and 
sequenced by standard procedure which involved: 1) generating PCR product with 
suitable primers of the library segments in the phage genome (15 mer library) or 
pCantab5E (40 mer library); and 2) sequencing PCR products using one primer of each 
PCR primer pair. Sequences were visually inspected or by using the Vector NTI 
alignment tool. 



Peptide Modulators 

[0290] The following serve as non-limiting examples of peptides: 

GDF WYE ACES S C AF W (SEQ ID NO: 53) 

CLRSGTGCAFQLYRF (SEQ ID NO:54) 

FAGQHWYDALDTLM (SEQ ID NO:5 5) 

LIFFDARDCCFNEQL (SEQ ID NO:56) 

LEWGSDVFYDVYDCC (SEQ ID NO:57) 

RIVPNGYFNVHGRSL (SEQ ID NO:5 8) 

WERS S AGC ADQQYRC (SEQ ID NO:59) 

YFSDGESFFEPGDCC (SEQ ID NO:60) 

Peptide Synthesis 

[0291] Peptides were synthesized on Fmoc-Knorr amide resin [N-(9- 

fluorenyl)methoxycarbonyl-Rnorr amide-resin; Midwest Biotech; Fishers, IN] with an 
Applied Biosystems (Foster City, CA) model 43 3 A synthesizer and the FastMoc 
chemistry protocol (0.25mmol scale) supplied with the instrument. Amino acids were 
double coupled as their N-a-Fmoc- derivatives and reactive side chains were protected 
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as follows: Asp, Glu: t-Butyl ester (OtBu); Ser, Thr, Tyr: t-Butyl ether (tBu); Asn, 
Cys, Gin, His: Triphenylmethyl (Trt); Lys, Trp: t-Butyloxycarbonyl (Boc); Arg: 
2,2,4,6,7-Pentamethyldihydrobenzofuran-5-sulfonyl (Pbf). After the final double 
coupling cycle, the N-terminal Fmoc group was removed by the multi-step treatment 
with piperidine in N-Methylpyrrolidone described by the manufacturer. The N- 
terminal free amines were then treated with 10% acetic anhydride, 5% 
Diisopropylamine in N-Methylpyrrolidone to yield the N-acetyl-derivative. The 
protected peptidyl-resins were simultaneously deprotected and removed from the resin 
by standard methods. The lyophilized peptides were purified on Cig to apparent 
homogeneity as judged by RP-HPLC analysis. Predicted peptide molecular weights 
were verified by electrospray mass spectrometry ( J. Biol. Chem . 273:12041-12046, 
1998). 

[0292] Cyclic analogs were prepared from the crude linear products. The 

cysteine disulfide was formed using one of the following methods: 

Method 1 : 

[0293] A sample of the crude peptide was dissolved in water at a concentration 

of 0.5 mg/mL and the pH adjusted to 8.5 with NH 4 OH. The reaction was stirred at 
room temperature, and monitored by RP-HPLC. Once completed, the reaction was 
adjusted to pH 4 with acetic acid and lyophilized. The product was purified and 
characterized as above. 

Method 2: 

[0294] A sample of the crude peptide was dissolved at a concentration of 

0.5mg/mL in 5% acetic acid. The pH was adjusted to 6.0 with NH 4 OH. DMSO (20% 
by volume) was added and the reaction was stirred overnight. After analytical RP- 
HPLC analysis, the reaction was diluted with water and triple lyophilized to remove 
DMSO. The crude product was purified by preparative RP-HPLC a ACS . 1 1 3 :6657, 
1991) 
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Assessing Affect of Peptides on GPCR Function. 

[0295] The effect of any one of these peptides on the function of the GPCR of 

the present invention was determined by adding an effective amount of each peptide to 
each functional assay. Representative functional assays are described more 
specifically herein, particularly Example 7. 

Uses Of The Peptide Modulators Of The Present Invention. 

[0296] The aforementioned peptides of the present invention may be useful for 

a variety of purposes, though most notably for modulating the function of the GPCR of 
the present invention, and potentially with other GPCRs of the same G-protein coupled 
receptor subclass (e.g., peptide receptors, adrenergic receptors, purinergic receptors, 
etc.), and/or other subclasses known in the art. For example, the peptide modulators of 
the present invention may be useful as HGPRBMY4 agonists. Alternatively, the 
peptide modulators of the present invention may be useful as HGPRBMY4 antagonists 
of the present invention. In addition, the peptide modulators of the present invention 
may be useful as competitive inhibitors of the HGPRBMY4 cognate ligand(s), or may 
be useful as non-competitive inhibitors of the HGPRBMY4 cognate ligand(s). 
[0297] Furthermore, the peptide modulators of the present invention may be 

useful in assays designed to either deorphan the HGPRBMY4 polypeptide of the 
present invention, or to identify other agonists or antagonists of the HGPRBMY4 
polypeptide of the present invention, particularly small molecule modulators. 
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EXAMPLE 9 

METHOD OF CREATING N- AND C -TERMINAL DELETION MUTANTS 
CORRESPONDING TO THE HGPRBMY4 POLYPEPTIDE 

[0298] As described elsewhere herein, the present invention encompasses the 

creation of N- and C-terminal deletion mutants, in addition to any combination of N- 
and C-terminal deletions thereof, corresponding to the HGPRBMY4 polypeptide of the 
present invention. A number of methods are available to one skilled in the art for 
creating such mutants. Such methods may include a combination of PCR 
amplification and gene cloning methodology. Although one of skill in the art of 
molecular biology, through the use of the teachings provided or referenced herein, 
and/or otherwise known in the art as standard methods, could readily create each 
deletion mutants of the present invention, exemplary methods are described below. 
[0299] Briefly, using the isolated cDNA clone encoding the full-length 

HGPRBMY4 polypeptide sequence, appropriate primers of about 15-25 nucleotides 
derived from the desired 5' and 3' positions of SEQ ID NO:l may be designed to PCR 
amplify, and subsequently clone, the intended N- and/or C-terminal deletion mutant. 
Such primers could comprise, for example, an initiation and stop codon for the 5 ' and 
3' primer, respectively. Such primers may also comprise restriction sites to facilitate 
cloning of the deletion mutant post amplification. Moreover, the primers may 
comprise additional sequences, such as, for example, flag-tag sequences, kozac 
sequences, or other sequences discussed and/or referenced herein. 
[0300] For example, in the case of the Q27 to P3 1 8 N-terminal deletion 

mutant, the following primers could be used to amplify a cDNA fragment 
corresponding to this deletion mutant: 

5 ' 5 '-GCAGCA GCGGCCGC CAGTTCTGGTTGGCCTTCCCATTG -3 ' (SEQ ID NO:49) 

Primer NotI 

3' 5'- GCAGCA GTCGAC GGGCTCTGAAGCGTGTGTGGCCAC -3' (SEQ ID NO:50) 

Primer Sail 

[0301] For example, in the case of the Ml to K297 C-terminal deletion mutant, 
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the following primers could be used to amplify a cDNA fragment corresponding to this 
deletion mutant: 

5' 5'- GCAGCA GCGGCCGC ATGATGGTGGATCCCAATGGCAATG -3' (SEQ ED NO:5 1) 

Primer NotI 

3' 5'- GCAGCA GTCGAC CTTCACTCCATAGACAATTGGGTTG -3' (SEQ ID NO:52) 

Primer Sail 

[0302] Representative PCR amplification conditions are provided below, 

although the skilled artisan would appreciate that other conditions may be required for 
efficient amplification. A 100 ul PCR reaction mixture may be prepared using lOng of 
the template DNA (cDNA clone of HGPRBMY4), 200 uM 4dNTPs, luM primers, 
0.25U Taq DNA polymerase (PE), and standard Taq DNA polymerase buffer. Typical 
PCR cycling condition are as follows: 

20-25 cycles: 45 sec, 93 degrees 

2 min, 50 degrees 

2 min, 72 degrees 
1 cycle: 10 min, 72 degrees 

[0303] After the final extension step of PCR, 5U Klenow Fragment may be 

added and incubated for 15 min at 30 degrees. 

[0304] Upon digestion of the fragment with the NotI and Sail restriction 

enzymes, the fragment could be cloned into an appropriate expression and/or cloning 
vector which has been similarly digested (e.g., pSportl, among others). The skilled 
artisan would appreciate that other plasmids could be equally substituted, and may be 
desirable in certain circumstances. The digested fragment and vector are then ligated 
using a DNA ligase, and then used to transform competent E.coli cells using methods 
provided herein and/or otherwise known in the art. 

[0305] The 5' primer sequence for amplifying any additional N-terminal 

deletion mutants may be determined by reference to the following formula: 

(S+(X * 3)) to ((S+(X * 3))+25), 
wherein 'S' is equal to the nucleotide position of the initiating start codon of the 
HGPRBMY4 gene (SEQ ID NO:l), and 'X' is equal to the most N-terminal amino 
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acid of the intended N-terminal deletion mutant. The first term will provide the start 
5' nucleotide position of the 5' primer, while the second term will provide the end 3' 
nucleotide position of the 5' primer corresponding to sense strand of SEQ ID NO:l. 
Once the corresponding nucleotide positions of the primer are determined, the final 
nucleotide sequence may be created by the addition of applicable restriction site 
sequences to the 5' end of the sequence, for example. As referenced herein, the 
addition of other sequences to the 5' primer may be desired in certain circumstances 
(e.g., kozac sequences, etc.). 

[0306] The 3' primer sequence for amplifying any additional N-terminal 

deletion mutants may be determined by reference to the following formula: 

(S+(X * 3)) to ((S+(X * 3))-25), 
wherein 'S' is equal to the nucleotide position of the initiating start codon of the 
HGPRBMY4 gene (SEQ ID NO:l), and 'X' is equal to the most C-terminal amino 
acid of the intended N-terminal deletion mutant. The first term will provide the start 
5' nucleotide position of the 3' primer, while the second term will provide the end 3' 
nucleotide position of the 3' primer corresponding to the anti-sense strand of SEQ ID 
NO:l. Once the corresponding nucleotide positions of the primer are determined, the 
final nucleotide sequence may be created by the addition of applicable restriction site 
sequences to the 5' end of the sequence, for example. As referenced herein, the 
addition of other sequences to the 3' primer may be desired in certain circumstances 
(e.g., stop codon sequences, etc.). The skilled artisan would appreciate that 
modifications of the above nucleotide positions may be necessary for optimizing PCR 
amplification. 

[0307] The same general formulas provided above may be used in identifying 

the 5' and 3' primer sequences for amplifying any C-terminal deletion mutant of the 
present invention. Moreover, the same general formulas provided above may be used 
in identifying the 5' and 3' primer sequences for amplifying any combination of N- 
terminal and C-terminal deletion mutant of the present invention. The skilled artisan 
would appreciate that modifications of the above nucleotide positions may be 
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necessary for optimizing PCR amplification. 

[0308] In preferred embodiments, the following N-terminal HGPRBMY4 

deletion polypeptides are encompassed by the present invention (of SEQ ID NO:2): 
M1-P318, M2-P318, V3-P318, D4-P318, P5-P318, N6-P318, G7-P318, N8-P318, E9- 
P318, S10-P318, S11-P318, A12-P318, T13-P318, Y14-P318, F15-P318, 116-P318, 
L17-P318, 118-P318, G19-P318, L20-P318, P21-P318, G22-P318, L23-P318, E24- 
P318, E25-P318, A26-P318, Q27-P318, F28-P318, W29-P318, L30-P318, A31-P318, 
F32-P318, P33-P318, L34-P318, C35-P318, S36-P318, L37-P318, Y38-P318, L39- 
P318, 140-P318, A41-P318, V42-P318, L43-P318, G44-P318, N45-P318, L46-P318, 
T47-P318, 148-P318, 149-P318, Y50-P318, 151-P318, V52-P318, R53-P318, T54- 
P318, E55-P318, H56-P318, S57-P318, L58-P318, H59-P318, E60-P318, P61-P318, 
M62-P318, Y63-P318, 164-P318, F65-P318, L66-P318, C67-P318, M68-P318, L69- 
P318, S70-P318, G71-P318, I72-P318, D73-P318, 174-P318, L75-P318, 176-P318, 
S77-P318, T78-P318, S79-P318, S80-P318, M81-P318, P82-P318, K83-P318, M84- 
P318, L85-P318, A86-P318, 187-P318, F88-P318, W89-P318, F90-P318, N91-P318, 
S92-P318, T93-P318, T94-P318, 195-P318, Q96-P318, F97-P318, D98-P318, A99- 
P318, C100-P318, L101-P318, L102-P318, Q103-P318, M104-P318, F105-P318, 
A106-P318, 1107-P318, H108-P318, S109-P318, L110-P318, S111-P318, G112-P318, 
M113-P318, E114-P318, S115-P318, T116-P318, V117-P318, L118-P318, LI 19- 
P318, A120-P318, M121-P318, A122-P318, F123-P318, D124-P318, R125-P318, 
Y126-P318, V127-P318, A128-P318, 1129-P318, C130-P318, H131-P318, P132-P318, 
L133-P318, R134-P318, H135-P318, A136-P318, T137-P318, V138-P318, L139- 
P318, T140-P318, L141-P318, P142-P318, R143-P318, V144-P318, T145-P318, 
K146-P318, 1147-P318, G148-P318, V149-P318, A150-P318, A151-P318, V152- 
P318, V153-P318, R154-P318, G155-P318, A156-P318, A157-P318, L158-P318, 
M159-P318, A160-P318, P161-P318, L162-P318, P163-P318, V164-P318, F165- 
P318, I166-P318, K167-P318, Q168-P318, L169-P318, P170-P318, F171-P318, C172- 
P318, R173-P318, S174-P318, N175-P318, 1176-P318, L177-P318, S178-P318, H179- 
P318, S180-P318, Y181-P318, C182-P318, L183-P318, H184-P318, Q185-P318, 
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D186-P318, V187-P318, M188-P318, K189-P318, L190-P318, A191-P318, C192- 
P318, D193-P318, D194-P318, 1195-P318, R196-P318, V197-P318, N198-P318, 
V199-P318, V200-P318, Y201-P318, G202-P318, L203-P318, 1204-P318, V205- 
P318, 1206-P318, 1207-P318, S208-P318, A209-P318, 1210-P318, G211-P318, L212- 
P318, D213-P318, S214-P318, L215-P318, L216-P318, 1217-P318, S218-P318, F219- 
P318, S220-P318, Y221-P318, L222-P318, L223-P318, I224-P318, L225-P318, K226- 
P318, T227-P318, V228-P318, L229-P318, G230-P318, L231-P318, T232-P318, 
R233-P318, E234-P318, A235-P318, Q236-P318, A237-P318, K238-P318, A239- 
P318, F240-P318, G241-P318, T242-P318, C243-P318, V244-P318, S245-P318, 
H246-P318, V247-P318, C248-P318, A249-P318, V250-P318, F251-P318, 1252-P318, 
F253-P318, Y254-P318, V255-P318, P256-P318, F257-P318, 1258-P318, G259-P318, 
L260-P318, S261-P318, M262-P318, V263-P318, H264-P318, R265-P318, F266- 
P318, S267-P318, K268-P318, R269-P318, R270-P318, D271-P318, S272-P318, 
P273-P318, L274-P318, P275-P318, V276-P318, 1277-P318, L278-P318, A279-P318, 
N280-P318, 1281-P318, Y282-P318, L283-P318, L284-P318, V285-P318, P286-P318, 
P287-P318, V288-P318, L289-P318, N290-P318, P291-P318, 1292-P318, V293-P318, 
Y294-P318, G295-P318, V296-P318, K297-P318, T298-P318, K299-P318, E300- 
P318, 1301-P318, R302-P318, Q303-P318, R304-P318, 1305-P318, L306-P318, R307- 
P318, L308-P318, F309-P318, H310-P318, V311-P318, and/or A312-P318 of SEQ ID 
NO:2. Polynucleotide sequences encoding these polypeptides axe also included in 
SEQ ID NO:l. The present invention also encompasses the use of these N-terminal 
HGPRBMY4 deletion polypeptides as immunogenic and/or antigenic epitopes as 
described elsewhere herein. 

[0309] In preferred embodiments, the following C-terminal HGPRBMY4 

deletion polypeptides are encompassed by the present invention (of SEQ ID NO:2): 
M1-P318, M1-E317, M1-S316, M1-A315, M1-H314, M1-T313, M1-A312, M1-V311, 
M1-H310, M1-F309, M1-L308, M1-R307, M1-L306, M1-I305, M1-R304, M1-Q303, 
M1-R302, M1-I301, M1-E300, M1-K299, M1-T298, M1-K297, M1-V296, M1-G295, 
M1-Y294, M1-V293, M1-I292, M1-P291, M1-N290, M1-L289, M1-V288, M1-P287, 
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M1-P286, M1-V285, M1-L284, M1-L283, M1-Y282, M1-I281, M1-N280, M1-A279, 
M1-L278, M1-I277, M1-V276, M1-P275, M1-L274, M1-P273, M1-S272, M1-D271, 
M1-R270, M1-R269, M1-K268, M1-S267, M1-F266, M1-R265, M1-H264, M1-V263, 
M1-M262, M1-S261, M1-L260, M1-G259, M1-I258, M1-F257, M1-P256, M1-V255, 
M1-Y254, M1-F253, M1-I252, M1-F251, M1-V250, M1-A249, M1-C248, M1-V247, 
M1-H246, M1-S245, M1-V244, M1-C243, M1-T242, M1-G241, M1-F240, M1-A239, 
M1-K238, M1-A237, M1-Q236, M1-A235, M1-E234, M1-R233, M1-T232, M1-L231, 
M1-G230, M1-L229, M1-V228, M1-T227, M1-K226, M1-L225, M1-I224, M1-L223, 
M1-L222, M1-Y221, M1-S220, M1-F219, M1-S218, M1-I217, M1-L216, M1-L215, 
M1-S214, M1-D213, M1-L212, M1-G211, M1-I210, M1-A209, M1-S208, Ml -1207, 
M1-I206, M1-V205, Ml -1204, M1-L203, M1-G202, M1-Y201, M1-V200, M1-V199, 
M1-N198, M1-V197, M1-R196, M1-I195, M1-D194, M1-D193, M1-C192, M1-A191, 
M1-L190, M1-K189, M1-M188, M1-V187, M1-D186, M1-Q185, Ml -HI 84, Ml- 
L183, M1-C182, M1-Y181, M1-S180, M1-H179, M1-S178, M1-L177, M1-I176, Ml- 
N175, M1-S174, M1-R173, M1-C172, M1-F171, M1-P170, Ml -LI 69, M1-Q168, Ml- 
K167, M1-I166, M1-F165, M1-V164, M1-P163, M1-L162, M1-P161, M1-A160, Ml- 
M159, M1-L158, M1-A157, M1-A156, M1-G155, M1-R154, M1-V153, M1-V152, 
M1-A151, M1-A150, M1-V149, M1-G148, M1-I147, M1-K146, M1-T145, M1-V144, 
M1-R143, M1-P142, M1-L141, M1-T140, M1-L139, M1-V138, M1-T137, M1-A136, 
M1-H135, M1-R134, M1-L133, M1-P132, M1-H131, M1-C130, M1-I129, M1-A128, 
M1-V127, M1-Y126, M1-R125, M1-D124, M1-F123, M1-A122, M1-M121, Ml- 
A120, M1-L119, M1-L118, M1-V117, M1-T116, M1-S115, Ml -El 14, M1-M113, 
M1-G112, Ml-Slll, M1-L110, M1-S109, M1-H108, M1-I107, M1-A106, M1-F105, 
M1-M104, M1-Q103, M1-L102, M1-L101, M1-C100, M1-A99, M1-D98, M1-F97, 
M1-Q96, M1-I95, M1-T94, M1-T93, M1-S92, M1-N91, M1-F90, M1-W89, M1-F88, 
M1-I87, M1-A86, M1-L85, M1-M84, M1-K83, M1-P82, M1-M81, M1-S80, M1-S79, 
M1-T78, M1-S77, M1-I76, M1-L75, Ml -174, M1-D73, M1-I72, M1-G71, M1-S70, 
M1-L69, M1-M68, M1-C67, M1-L66, M1-F65, Ml -164, M1-Y63, M1-M62, M1-P61, 
M1-E60, M1-H59, M1-L58, M1-S57, M1-H56, M1-E55, M1-T54, M1-R53, M1-V52, 
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M1-I51, M1-Y50, M1-I49, M1-I48, M1-T47, M1-L46, M1-N45, M1-G44, M1-L43, 
M1-V42, M1-A41, M1-I40, M1-L39, M1-Y38, M1-L37, M1-S36, M1-C35, M1-L34, 
M1-P33, M1-F32, M1-A31, M1-L30, M1-W29, M1-F28, M1-Q27, M1-A26, M1-E25, 
M1-E24, M1-L23, M1-G22, M1-P21, M1-L20, M1-G19, M1-I18, M1-L17, M1-I16, 
M1-F15, M1-Y14, M1-T13, M1-A12, Ml-Sll, M1-S10, M1-E9, M1-N8, and/or Ml- 
G7 of SEQ ID NO:2. Polynucleotide sequences encoding these polypeptides are also 
included in SEQ ID NO:l . The present invention also encompasses the use of these C- 
terminal HGPRBMY4 deletion polypeptides as immunogenic and/or antigenic 
epitopes as described elsewhere herein. 

[03 1 0] Alternatively, preferred polypeptides of the present invention may 

comprise polypeptide sequences corresponding to, for example, internal regions of the 
HGPRBMY4 polypeptide (e.g., any combination of both N- and C- terminal 
HGPRBMY4 polypeptide deletions) of SEQ ID NO:2. For example, internal regions 
could be defined by the equation: amino acid NX to amino acid CX, wherein NX 
refers to any N-terminal deletion polypeptide amino acid of HGPRBMY4 (SEQ ID 
NO:2), and where CX refers to any C-termrnal deletion polypeptide amino acid of 
HGPRBMY4 (SEQ ID NO:2). Polynucleotides encoding these polypeptides are also 
included in SEQ ID NO: 1. The present invention also encompasses the use of these 
polypeptides as an immunogenic and/or antigenic epitope as described elsewhere 
herein. 

EXAMPLE 10 

METHOD OF ENHANCING THE BIOLOGICAL ACTIVITY OR FUNCTIONAL 
CHARACTERISTICS THROUGH MOLECULAR EVOLUTION. 

[0311] Although many of the most biologically active proteins known are 

highly effective for their specified function in an organism, they often possess 
characteristics that make them undesirable for transgenic, therapeutic, pharmaceutical, 
and/or industrial applications. Among these traits, a short physiological half-life is the 
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most prominent problem, and is present either at the level of the protein, or the level of 
the proteins mRNA. The ability to extend the half-life, for example, would be 
particularly important for a proteins use in gene therapy, transgenic animal production, 
the bioprocess production and purification of the protein, and use of the protein as a 
chemical modulator among others. Therefore, there is a need to identify novel variants 
of isolated proteins possessing characteristics which enhance their application as a 
therapeutic for treating diseases of animal origin, in addition to the proteins 
applicability to common industrial and pharmaceutical applications. 
[0312] Thus, one aspect of the present invention relates to the ability to 

enhance specific characteristics of invention through directed molecular evolution. 
Such an enhancement may, in a non-limiting example, benefit the inventions utility as 
an essential component in a kit, the inventions physical attributes such as its solubility, 
structure, or codon optimization, the inventions specific biological activity, including 
any associated enzymatic activity, the proteins enzyme kinetics, the proteins Ki, Kcat, 
Km, Vmax, Kd, protein-protein activity, protein-DNA binding activity, 
antagonist/inhibitory activity (including direct or indirect interaction), agonist activity 
(including direct or indirect interaction), the proteins antigenicity (e.g., where it would 
be desirable to either increase or decrease the antigenic potential of the protein), the 
immunogenicity of the protein, the ability of the protein to form dimers, trimers, or 
multimers with either itself or other proteins, the antigenic efficacy of the invention, 
including its subsequent use a preventative treatment for disease or disease states, or as 
an effector for targeting diseased genes. Moreover, the ability to enhance specific 
characteristics of a protein may also be applicable to changing the characterized 
activity of an enzyme to an activity completely unrelated to its initially characterized 
activity. Other desirable enhancements of the invention would be specific to each 
individual protein, and would thus be well known in the art and contemplated by the 
present invention. 

[0313] For example, an engineered G-protein coupled receptor may be 

constitutively active upon binding of its cognate ligand. Alternatively, an engineered 
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G-protein coupled receptor may be constitutively active in the absence of ligand 
binding. In yet another example, an engineered GPCR may be capable of being 
activated with less than all of the regulatory factors and/or conditions typically 
required for GPCR activation (e.g., ligand binding, phosphorylation, conformational 
changes, etc.). Such GPCRs would be useful in screens to identify GPCR modulators, 
among other uses described herein. 

[0314] Directed evolution is comprised of several steps. The first step is to 

establish a library of variants for the gene or protein of interest. The most important 
step is to then select for those variants that entail the activity you wish to identify. The 
design of the screen is essential since your screen should be selective enough to 
eliminate non-useful variants, but not so stringent as to eliminate all variants. The last 
step is then to repeat the above steps using the best variant from the previous screen. 
Each successive cycle, can then be tailored as necessary, such as increasing the 
stringency of the screen, for example. 

[0315] Over the years, there have been a number of methods developed to 

introduce mutations into macromolecules. Some of these methods include, random 
mutagenesis, "error-prone" PCR, chemical mutagenesis, site-directed mutagenesis, and 
other methods well known in the art (for a comprehensive listing of current 
mutagenesis methods, see Maniatis, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring, NY (1982)). Typically, such methods have been 
used, for example, as tools for identifying the core functional region(s) of a protein or 
the function of specific domains of a protein (if a multi-domain protein). However, 
such methods have more recently been applied to the identification of macromolecule 
variants with specific or enhanced characteristics. 

[031 6] Random mutagenesis has been the most widely recognized method to 

date. Typically, this has been carried out either through the use of "error-prone" PCR 
(as described in Moore, J., et al, Nature Biotechnology 14:458, (1996), or through the 
application of randomized synthetic oligonucleotides corresponding to specific regions 
of interest (as descibed by Derbyshire, K.M. et al, Gene, 46:145-152, (1986), and Hill, 
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DE, et al, Methods Enzymol., 55:559-568, (1987). Both approaches have limits to the 
level of mutagenesis that can be obtained. However, either approach enables the 
investigator to effectively control the rate of mutagenesis. This is particularly 
important considering the fact that mutations beneficial to the activity of the enzyme 
are fairly rare. In fact, using too high a level of mutagenesis may counter or inhibit the 
desired benefit of a useful mutation. 

[031 7] While both of the aforementioned methods are effective for creating 

randomized pools of macromolecule variants, a third method, termed "DNA 
Shuffling", or "sexual PCR" (WPC, Stemmer, PNAS, 91:10747, (1994)) has recently 
been elucidated. DNA shuffling has also been referred to as "directed molecular 
evolution", "exon- shuffling", "directed enzyme evolution", "in vitro evolution", and 
"artificial evolution". Such reference terms are known in the art and are encompassed 
by the invention. This new, preferred, method apparently overcomes the limitations of 
the previous methods in that it not only propagates positive traits, but simultaneously 
eliminates negative traits in the resulting progeny. 

[0318] DNA shuffling accomplishes this task by combining the principal of in 

vitro recombination, along with the method of "error-prone" PCR. In effect, you begin 
with a randomly digested pool of small fragments of your gene, created by Dnase I 
digestion, and then introduce said random fragments into an "error-prone" PCR 
assembly reaction. During the PCR reaction, the randomly sized DNA fragments not 
only hybridize to their cognate strand, but also may hybridize to other DNA fragments 
corresponding to different regions of the polynucleotide of interest - regions not 
typically accessible via hybridization of the entire polynucleotide. Moreover, since the 
PCR assembly reaction utilizes "error-prone" PCR reaction conditions, random 
mutations are introduced during the DNA synthesis step of the PCR reaction for all of 
the fragments -further diversifying the potential hybridation sites during the annealing 
step of the reaction. 

[0319] A variety of reaction conditions could be utilized to carry-out the DNA 

shuffling reaction. However, specific reaction conditions for DNA shuffling are 
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provided, for example, in PNAS . 91:10747, (1994). Briefly, prepare the DNA 
substrate to be subjected to the DNA shuffling reaction. Preparation may be in the 
form of simply purifying the DNA from contaminating cellular material, chemicals, 
buffers, oligonucleotide primers, deoxynucleotides, RNAs, etc., and may entail the use 
of DNA purification kits as those provided by Qiagen, Inc., or by the Promega, Corp., 
for example. 

[0320] Once the DNA substrate has been purified, it would be subj ected to 

Dnase I digestion. About 2-4ug of the DNA substrate(s) would be digested with .0015 
units of Dnase I (Sigma) per ul in lOOul of 50mM Tris-HCL, pH 7.4/lmM MgC12 for 
10-20 min. at room temperature. The resulting fragments of 10-50bp could then be 
purified by running them through a 2% low-melting point agarose gel by 
electrophoresis onto DE81 ion-exchange paper (Whatman) or could be purified using 
Microcon concentrators (Amicon) of the appropriate molecular weight cuttoff, or 
could use oligonucleotide purification columns (Qiagen), in addition to other methods 
known in the art. If using DE81 ion-exchange paper, the 10-50bp fragments could be 
eluted from said paper using 1M NaCL, followed by ethanol precipitation. 
[0321] The resulting purified fragments would then be subjected to a PCR 

assembly reaction by re-suspension in a PCR mixture containing: 2mM of each dNTP, 
2.2mM MgC12, 50 mM KC1, lOmM Tris«HCL, pH 9.0, and 0.1% Triton X-100, at a 
final fragment concentration of 10-30ng/ul. No primers are added at this point. Taq 
DNA polymerase (Promega) would be used at 2.5 units per lOOul of reaction mixture. 
A PCR program of 94 C for 60s; 94 C for 30s, 50-55 C for 30s, and 72 C for 30s using 
30-45 cycles, followed by 72 C for 5min using an MJ Research (Cambridge, MA) 
PTC-150 thermocycler. After the assembly reaction is completed, a 1 :40 dilution of 
the resulting primerless product would then be introduced into a PCR mixture (using 
the same buffer mixture used for the assembly reaction) containing 0.8um of each 
primer and subjecting this mixture to 15 cycles of PCR (using 94 C for 30s, 50 C for 
30s, and 72 C for 30s). The referred primers would be primers corresponding to the 
nucleic acid sequences of the polynucleotide(s) utilized in the shuffling reaction. Said 
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primers could consist of modified nucleic acid base pairs using methods known in the 
art and referred to else where herein, or could contain additional sequences (i.e., for 
adding restriction sites, mutating specific base-pairs, etc.). 
[0322] The resulting shuffled, assembled, and amplified product can be 

purified using methods well known in the art (e.g., Qiagen PCR purification kits) and 
then subsequently cloned using appropriate restriction enzymes. 

[0323] Although a number of variations of DNA shuffling have been published 

to date, such variations would be obvious to the skilled artisan and are encompassed by 
the invention. The DNA shuffling method can also be tailered to the desired level of 
mutagenesis using the methods described by Zhao, et al. fNucl Acid Res ., 25(6): 1307- 
1308,(1997). 

[0324] As described above, once the randomized pool has been created, it can 

then be subjected to a specific screen to identify the variant possessing the desired 
characteristic(s). Once the variant has been identified, DNA corresponding to the 
variant could then be used as the DNA substrate for initiating another round of DNA 
shuffling. This cycle of shuffling, selecting the optimized variant of interest, and then 
re-shuffling, can be repeated until the ultimate variant is obtained. Examples of model 
screens applied to identify variants created using DNA shuffling technology may be 
found in the following publications: J. C, Moore, et al., J. Mol. Biol .. 272:336-347, 
(1997), F.R., Cross, et al., Mol. Cell. Biol .. 18:2923-2931, (1998), and A. Crameri., et 
al.. Nat. Biotech .. 15:436-438, (1997). 

[0325] DNA shuffling has several advantages. First, it makes use of beneficial 

mutations. When combined with screening, DNA shuffling allows the discovery of 
the best mutational combinations and does not assume that the best combination 
contains all the mutations in a population. Secondly, recombination occurs 
simultaneously with point mutagenesis. An effect of forcing DNA polymerase to 
synthesize full-length genes from the small fragment DNA pool is a background 
mutagenesis rate. In combination with a stringent selection method, enzymatic activity 
has been evolved up to 16,000 fold increase over the wild-type form of the enzyme. In 
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essence, the background mutagenesis yielded the genetic variability on which 
recombination acted to enhance the activity. 

[0326] A third feature of recombination is that it can be used to remove 

deleterious mutations. As discussed above, during the process of the randomization, 
for every one beneficial mutation, there may be at least one or more neutral or 
inhibitory mutations. Such mutations can be removed by including in the assembly 
reaction an excess of the wild-type random-size fragments, in addition to the random- 
size fragments of the selected mutant from the previous selection. During the next 
selection, some of the most active variants of the polynucleotide/polypeptide/enzyme, 
should have lost the inhibitory mutations. 

[0327] Finally, recombination enables parallel processing. This represents a 

significant advantage since there are likely multiple characteristics that would make a 
protein more desirable (e.g. solubility, activity, etc.). Since it is increasingly difficult 
to screen for more than one desirable trait at a time, other methods of molecular 
evolution tend to be inhibitory. However, using recombination, it would be possible to 
combine the randomized fragments of the best representative variants for the various 
traits, and then select for multiple properties at once. 

[0328] DNA shuffling can also be applied to the polynucleotides and 

polypeptides of the present invention to decrease their immunogenicity in a specified 
host. For example, a particular varient of the present invention may be created and 
isolated using DNA shuffling technology. Such a variant may have all of the desired 
characteristics, though may be highly immunogenic in a host due to its novel intrinsic 
structure. Specifically, the desired characteristic may cause the polypeptide to have a 
non-native strucuture which could no longer be recognized as a "self molecule, but 
rather as a "foreign", and thus activate a host immune response directed against the 
novel variant. Such a limitation can be overcome, for example, by including a copy of 
the gene sequence for a xenobiotic ortholog of the native protein in with the gene 
sequence of the novel variant gene in one or more cycles of DNA shuffling. The 
molar ratio of the ortholog and novel variant DNAs could be varied accordingly. 
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Ideally, the resulting hybrid variant identified would contain at least some of the 
coding sequence which enabled the xenobiotic protein to evade the host immune 
system, and additionally, the coding sequence of the original novel varient that 
provided the desired characteristics. 

[0329] Likewise, the invention encompasses the application of DNA shuffling 

technology to the evolution of polynucletotides and polypeptides of the invention, 
wherein one or more cycles of DNA shuffling include, in addition to the gene template 
DNA, oligonucleotides coding for known allelic sequences, optimized codon 
sequences, known variant sequences, known polynucleotide polymorphism sequences, 
known ortholog sequences, known homolog sequences, additional homologous 
sequences, additional non-homologous sequences, sequences from another species, 
and any number and combination of the above. 

[0330] In addition to the described methods above, there are a number of 

related methods that may also be applicable, or desirable in certain cases. 
Representative among these are the methods discussed in PCT applications WO 
98/31700, and WO 98/32845, which are hereby incorporated by reference. 
Furthermore, related methods can also be applied to the polynucleotide sequences of 
the present invention in order to evolve invention for creating ideal variants for use in 
gene therapy, protein engineering, evolution of whole cells containing the variant, or in 
the evolution of entire enzyme pathways containing polynucleotides of the invention 
as described in PCT applications WO 98/13485, WO 98/13487, WO 98/27230, WO 
98/31837, and Crameri, A., et al., Nat. Biotech ., 15:436-438, (1997), respectively. 
[0331] Additional methods of applying "DNA Shuffling" technology to the 

polynucleotides and polypeptides of the present invention, including their proposed 
applications, may be found in US Patent No. 5,605,793; PCT Application No. WO 
95/22625; PCT Application No. WO 97/20078; PCT Application No. WO 97/35966; 
and PCT Application No. WO 98/42832; PCT Application No. WO 00/09727 
specifically provides methods for applying DNA shuffling to the identification of 
herbicide selective crops which could be applied to the polynucleotides and 
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polypeptides of the present invention; additionally, PCT Application No. WO 
00/12680 provides methods and compositions for generating, modifying, adapting, and 
optimizing polynucleotide sequences that confer detectable phenotypic properties on 
plant species; each of the above are hereby incorporated in their entirety herein for all 
purposes. 

[0332] The contents of all patents, patent applications, published PCT 

applications and articles, books, references, reference manuals and abstracts cited 
herein are hereby incorporated by reference in their entirety to more fully describe the 
state of the art to which the invention pertains. 

[0333] As various changes can be made in the above-described subject matter 

without departing from the scope and spirit of the present invention, it is intended that 
all subject matter contained in the above description, or defined in the appended 
claims, be interpreted as descriptive and illustrative of the present invention. Many 
modifications and variations of the present invention are possible in light of the above 
teachings. 
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